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Abstract 


Software  for  stiff  differential  equations  is  surveyed.  The 
classical  stiff  problem  is  well-served  by  existing  software  and  some 
codes  are  appearing  for  some  of  the  other  areas,  particularly  problems 
with  eigenvalues  near  the  imaginary  axis.  A  number  of  other 
difficulties  are  identified  and  the  state  of  and  possibilities  for 
software  development  in  these  areas  are  discussed. 


♦Supported  in  part  by  the  U.S.  Department  of  Energy,  Grant  DOE 
DEAC0276ER02383. 


1.  Introduction 


In  addition  to  the  large  amount  of  fine  theoretical  work  that  has 
been  done  on  stiff  equations  in  the  last  decade,  there  have  been  a  great 
number  of  new  methods  developed  for  solving  stiff  equations  in  the  last 
fifteen  years,  and  a  great  many  improvements  to  existing  methods,  this 
much  is  very  obvious  from  the  literature.  In  preparing  to  write  this 
paper  I  surveyed  a  few  people  in  major  laboratories  and  looked  over  some 
of  .the  literature  concerned  with  libraries  and  codes,  and  was  surprised 
to  find  that  in  spite  of  this  large  amount  of  work,  relatively  little  of 
it  has  found  its  way  into  the  hands  of  the  users.  In  fact,  except  for 
those  fortunate  enough  to  have  direct  access  to  some  of  the  people 
active  in  ODE  code  development,  virtually  all  users  work  with  codes 
based  on  BDF  methods  from  the  NAG  or  IMSL  libraries,  from  Hindmarsh's 
LSODE  set,  or  derivatives  of  earlier  codes  on  which  these  were  based. 
This  is  probably  no  surprise  to  users  on  the  other  side  of  the  fence,  as 
they  have  long  said  that  until  we  numerical  analysts  take  time  to  write 
good  software  and  get  it  out  to  the  users,  our  ideas  will  not  be  put 
into  action.  However,  here  in  1982  many  of  "we  numerical  analysts"  feel 
that  we  have  seen  the  light  and  now  think  of  ourselves  as  "numerlciens,” 
people  who  are  concerned  about  the  real  numerical  aspects  of  problems 
and  produce  mathematical  software  to  present  our  results.  Yet,  in  spite 
of  this,  very  few  of  the  new  methods  are  actually  in  use  and  most  of  the 
highly  used  software  is  based  on  early  methods. 


Does  this  mean  that  the  user  has  no  needs  that  are  not  being  met  by 
existing  codes?  I  don't  think  so.  The  main  development  that  has 
affected  the  average  user  over  the  last  decade  is  that  earlier  software 
has  been  greatly  improved  in  two  ways:  fine  tuning  and  user  interfaces. 
Although  there  have  been  some  codes  developed  in  that  period  which 
embody  new  methods,  they  have  not  undergone  the  careful  modification  to 
tune  them  for  user  use  such  as,  for  example,  Hindmarsh  has  applied  in 
developing  the  LSODE  set  of  codes  from  earlier  codes.  There  are  a 
number  of  user  needs  that  are  not  met  by  the  currently  highly  used 
codes.  For  some  of  these  needs  there  are  methods  but  the  software  is  in 
various  states  of  development  from  experimental  (a  euphemism  for  badly 
written)  to  what  we  might  call  preproduction  versions.  For  other  needs 
there  currently  does  not  exist  software,  or  in  some  cases  even 
reasonable  methods. 

This  paper  will  first  list  a  number  of  areas  that  are  part  of  or 
impact  on  the  solution  of  stiff  equations.  Then,  each  of  these  areas 
will  be  examined  to  see  some  of  what  is  available,  both  in  methods  and 
software,  and  where  some  of  the  currently  recognized  difficulties  lie. 

2^.  Classification  of  problems 

The  phenomenon  of  stiffness  that  first  caused  numerical  difficulty 
was  due  to  eigenvalues  with  large  negative  real  parts.  The  methods 
developed  to  solve  those  are  generally  effective  for  problems  which  are 
such  that  all  eigenvalues,  \(J),  of  J  satisfy  Arg(-X,(J))  <  60°.  For 
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example,  the  BDF  formulas  are  A(73.23°)  stable  for  orders  up  to  four, 
while  the  fifth  order  formula  Is  A(51.83°)  stable  (see  [Gaff82],  Table 
5.1).  We  will  refer  to  this  angle  as  the  argument  of  stability.  A 
sequence  of  codes  have  been  written  based  on  these  methods,  and  the 
current  versions  of  these  codes  are  the  ones  in  common  use  today.  I 
suspect  the  reason  that  newer  methods  are  having  very  little  impact  in 
this  area  is  that  the  underlying  method  in  these  older  codes  is  within  a 
small  factor  of  being  as  good  as  one  can  hope  to  do,  and  the  codes  based 
on  these  methods  have  evolved  over  a  decade  to  be  so  finely  tuned  that 
recent  codes  based  on  slightly  better  methods  show  very  little 
improvement.  Furthermore,  the  fact  that  these  older  codes  have  been 
developed  in  a  "user-active"  environment  has  caused  them  to  be  well- 
tailored  to  user  needs  as  far  as  interfaces  go. 

However,  there  are  other  types  of  problems  that  are  frequently 
classified  as  "stiff"  and  additional  features  of  stiff  problems  that 
cause  difficulties  for  many  existing  codes.  There  are  also  a  number  of 
features  that  are  needed  in  codes  but  are  not  yet  commonly  available. 
Among  these  are: 

(i)  Lightly  damped  oscillatory  problems — they  have  large  eigenvalues 
near  the  imaginary  axis  and  for  which  the  higher-order  BDF 
methods  are  unstable  for  a  large  range  of  stepsizes. 

(ii)  Linear,  constant-coefficient  problems  for  which  more  efficient 
techniques  are  possible. 


(Ill)  Large  systems  of  equations  in  which  the  matrix  manipulation 


(iv) 

<v) 

(vi) 

(vii) 

(viii) 

(ix) 

(x) 


becomes  a  large  part  of  the  cost. 

Implicit  differential  equations  which,  if  large,  should  be 
handled  directly  and  not  converted  to  explicit  form  because  of 
the  loss  of  sparsity. 

Differential-algebraic  equations  which  are  an  extension  of 
implicit  equations  in  which  the  number  of  differential  equations 
is  less  than  the  number  of  equations  and  variables  so  that  fewer 
Initial  values  can  be  specified. 

Differential  equations  with  discontinuities — these  cause  many 
codes  to  be  very  inefficient. 

Problems  whose  solution  contains  very  high  frequency  components 
of  little  interest  but  which  cannot  be  Ignored  without 
introducing  error  because  of  non-linearities. 

Systems  of  equations  which  can  be  partitioned  into  sets  which 
change  at  very  different  speeds  and  can  then  be  handled  with 
different  stepslzes  and/or  different  methods. 

Equations  with  delayed  terms  present. 

The  need  to  estimate  global  error  in  codes — most  current  cod.es 
attempt  to  control  local  error  only. 


(xi)  The  need  Co  switch  between  different  methods  according  to  the 
nature  of  the  problem  as  it  changes  over  the  interval. 

(xii)  The  need  to  have  codes  whose  output  is  a  "smooth"  function  of 
its  inputs — for  example,  if  the  integrator  provides  data  for 
another  code  such  as  an  optimization  program,  it  is  important 
that  the  integrator  appear  to  implement  a  smooth  function  or  the 
optimizer  may  get  stuck  in  a  local  minimum  that  represents 
noise,  or  will  have  great  difficulty  making  realistic  estimates 
of  the  derivatives  of  the  output  of  the  integrator. 

The  sections  below  discuss  each  of  these  problem  areas  and  the  current 
state  of  software  development. 

3.  •  Current  software  for  the  standard  stiff  problem 

In  addition  to  the  codes  in  the  IMSL  and  NAG  libraries  and  the 

codes  of  Hindmarsh  [HindSO],  [Hind79b],  and  [Hind79a},  all  of  which  are 

widely  available,  there  are  a  number  of  newer  codes  in  various  stages  of 

development  which  may  have  an  impact  in  the  near  future.  Some  of  these 
are  discussed  in  the  sections  below  because  they  appear  to  be 
particularly  suitable  for  some  of  the  special  problems.  Among  the  other 
codes  are  two  recent  "entries  to  the  market"  based  on  techniques  not 

used  in  common  stiff  codes.  Bader  and  Deuflhard  [BaDe81]  have  developed 
an  extrapolation  code  based  on  a  semi-implicit  midpoint  rule.  This 
midpoint  rule  has  the  advantage  that  it  does  not  require  iteration  of 
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nonlinear  equations  involving  the  Jacobian  and  yet  has  a  quadratic  error 
expansion  so  that  each  extrapolation  step  adds  two  orders  of  accuracy. 
Their  program,  METAN1,  is  still  experimental  but  is  a  promising 
candidate  for  future  general  use.  A  second  program,  DSTIFF,  is  based  on 

multistep  formulas  chosen  to  provide  a  least-squares  fit  to  a  number  of 

accuracy  requirements  as  well  as  stability  requirements.  The  argument 
of  stability  reported  by  Gupta  in  [GuptSl]  exceeds  62°  all  the  way  to 
order  ten,  so  these  methods  may  also  be  useful  for  the  problems 
described  in  the  next  section. 

Mention  should  also  be  made  of  many  codes  designed  primarily  for 
initial  value  problems  in  PDEs  and  of  codes  designed  for  particular  user 
groups  such  as  people  solving  chemical  kinetic  problems,  network 
simulation  problems,  etc.  PDE  codes  often  make  use  of  ODE  codes  via  the 

method  of  lines.  In  some  cases  they  use  standard  ODE  codes.  (See,  for 

example,  [Hyma79],  [MaSi79],  [MeSi81a],  [MeSi81b],  and  [SiMa75].  For  a 
survey,  see  [MaSw80].)  Other  codes  are  applicable  to  PDEs  or  ODEs ,  or 
combinations  of  the  two.  For  example,  the  FORSIM  package  [Carv73]. 
There  are  also  many  other  codes  which  have  been  tailored  for  specific 
users,  e.g.  FACSIMILE  [CCJK77],  and  there  are  undoubtedly  many  other 
experimental  codes  of  which  I  am  unaware.  I  apologize  to  those  whose 
codes  I  have  overlooked  below;  I  would  like  to  hear  about  them. 

4_.  Lightly  damped  oscillatory  problems 

When  the  Jacobian  has  large  eigenvalues  near  to  but  to  the  left  of 


the  imaginary  axis,  the  solution  to  the  problem  does  not  exhibit 
oscillation  (except  possibly  in  a  transient  region)  provided  the  driving 
terras  are  not  adding  energy  at  the  corresponding  frequency,  but  many 
numerical  methods  will  have  difficulty  because  they  will  fail  to  damp 
the  oscillation  or  even  amplify  it.  The  BDF  methods  are  particularly 
bad  because  they  are  not  A-stable  above  second  order  and  are  strongly 
amplifying  for  eigenvalues  close  to  the  imaginary  axis  at  moderate 
distances  from  the  origin.  These  problems  are  very  similar  to  stiff 
problems  in  that  common  numerical  methods  tend  to  amplify  components 
that  are  not  present  in  the  true  solution.  Miranker  [Mira81]  has  also 
pointed  out  that  they  are  similar  in  that  both  can  be  viewed  from  the 
singular  perturbation  point  of  view.  However,  they  are  not  similar  when 
one  considers  that  the  classical  stiff  problem  is  "superstable"  and  any 
perturbation  is  rapidly  damped  in  the  actual  system,  whereas  the  lightly 
damped  oscillating  problem  is  not  superstable  and  a  perturbation  can 
cause  an  effect  in  the  solution  for  some  time.  If  the  eigenvalue  is 
very  close  to  the  imaginary  axis,  an  oscillatory  component  could  take  a 
long  time  to  damp  out,  and,  until  that  happens,  almost  all  integrators 
would  be  forced  to  take  a  small  step  to  follow  the  high  frequency 
component.  (It  is,  of  course,  possible  to  use  methods  which  are  exact 
for  certain  oscillatory  functions.  This  is  a  form  of  exponential 
fitting  and  requires  knowledge  of  the  eigenvalues  of  the  Jacobian.  Such 
methods  are  not  usually  too  useful  for  general  purpose  software.)  It  is 
tempting  to  consider  methods  that  damp  the  oscillations  rapidly  so  that 
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they  do  not  hinder  the  numerical  method.  This  is  a  satisfactory 
approach  in  those  problems  in  which  the  oscillating  term  does  not  have  a 
strongly  nonlinear  effect,  so  that  damping,  which  is  equivalent  to  a 
time  averaging,  does  not  change  the  slow  components.  A  problem  such  as 


w  »  0 


2  .  ,2 

y  +  y 


cannot  be  damped  because  the  value  of  z(t)  is  strongly  dependent  on  the 
amplitude  of  the  oscillation.  Such  problems  will  be  discussed  in  a 
later  section.  In  this  section  we  examine  those  problems  in  which  the 
oscillation  is  either  absent  or  can  be  damped  without  harm.  For  these 
problems  we  need  methods  that  are  "more  stable"  than  the  underlying 
problem  near  the  imaginary  axis  to  damp  the  unwanted  components. 


Gaffney  has  examined  several  codes  for  these  types  of  problems 
having  found  codes  based  on  the  BDF  formulas  to  be  unsatisfactory.  Four 
codes  are  compared  in  [Gaff81].  They  are  STRIDE  [BuBC79],  STINT 
[TeBP78] ,  DIRK  [Alex77],  and  BLEND  !SkKo77].  Of  these,  BLEND  and  DIRK 
are  experimental  codes  which  in  no  sense  are  in  a  form  for  the  general 
user,  having  been  developed  as  a  test  bed  for  the  underlying  ideas  in 
the  papers  cited.  STRIDE  is  an  implementation  of  implicit  Runge  Kutta 
methods  available  as  a  Fortran  or  Algol  code.  STRIDE  uses  singly- 
implicit  RK  methods  [Burr73]  which  give  stability  out  to  an  argument  of 
at  least  83°.  (STRIDE  can  also  be  used  for  nonstiff  equations  by  using 
fixed-point  iteration  instead  of  Newton's  methods  to  solve  the  nonlinear 


equations  at  each  step,  but  it  appears  that  its  strength  is  for  stiff 
equations  with  eigenvalues  near  the  imaginary  axis.)  STINT  is  an 
implementation  of  cyclic  composite  methods.  These  achieve  larger 
regions  of  absolute  stability  than  is  possible  with  simple  linear 
multistep  methods  by  applying  a  sequence  of  different  raultistep  methods. 
If  its  order  is  limited  to  four,  it  is  stable  out  to  arguments  of  almost 
81°,  far  superior  to  the  BDF  methods.  BLEND  uses  a  combination  of  Adams 
and  BDF  methods  in  a  way  that  can  give  the  stability  of  second 
derivative  methods.  The  basic  idea  is  that  if  the  Adams  method  is 
written  as  A  ■  0,  the  BDF  method  as  B  ■  0,  and  J  is  the  Jacobian,  the 
linear  combination 

A  +  J.B  -  0 

becomes  the  Adams  method  when  J  is  small,  and  the  BDF  method  when  J  is 
large.  However,  it  also  involves  a  term  of  the  form  Jy'  ir  the  J.B 
component.  For  the  linear  ODE  y'  ■  Jy  we  see  that  y"  *  Jy'*  so  this 
term  is  equivalent  to  the  second  derivative.  Consequently  it  can 
achieve  the  stability  of  second  derivative  methods — for  example,  it  can 
be  A-stable  up  to  the  fourth  order.  Furthermore,  its  accuracy  is 
obtained  from  the  accuracy  of  the  underlying  Adams  and  BDF  methods,  so 
it  does  not  depend  on  the  accuracy  of  the  Jacobian  for  integration 
accuracy;  an  error  in  the  Jacobian  only  affects  the  region  of  stability. 
Consequently,  it  is  not  necessary  to  re-evaluate  the  Jacobian 
frequently,  as  would  be  required  in  second  derivative  methods  to  be 


11 


discussed  in  a  later  section.  Table  S.l  of  [Gaff 82]  compares  the 
maximum  argument  of  stability  of  STINT,  BLEND  and  the  BDF  methods.  Even 
through  order  7,  BLEND  is  competitive  with  the  83°  of  the  STRIDE 
implementation.  Unfortunately  BLEND  has  never  been  developed  into  a 
production  code.  (We  do  have  a  version  embedded  in  LSODE  in  addition  to 
the  original  modification  of  DIFSUB,  but  It  Is  inadequately  documented.) 
DIRK  is  an  experimental  Implementation  of  diagonally  implicit  Runge- 
Kutta  methods  with  a  fixed  order.  The  diagonal  coefficients  are  chosen 
to  be  the  same  so  that  although  at  each  step  of  an  R-stage  method  it  is 
necessary  to  solve  R  systems  of  non-linear  equations  versus  the  one 
system  of  multistep  methods,  the  same  LU  factorization  can  be  used  for 
all  equations.  [Gaff82]  presents  a  table  of  recommendations  based  on 
his  tests  on  page  80  of  the  cited  document.  Since  the  method  of  choice 
varies  with  accuracy  requirements,  argument  of  stability  needs,  and  user 
demands  on  the  interface  to  the  code,  1  refer  the  reader  who  needs  to 
solve  such  problems  to  that  document  for  further  details.  (Second 
derivative  methods  may  also  be  competitive  for  these  problems  because 
they  have  a  large  argument  of  stability,  although  they  tend  to  be 
Inefficient  for  large  problems  with  a  non-constant  Jacobian.  See  the 
remarks  la  the  next  section.) 

5_.  Linear,  constant-coefficient  problems 

Although  there  are  probably  not  many  problems  in  this  category, 
considerable  savings  are  possible  when  they  occur.  Clearly  it  is  not 


necessary  to  re-evaluate  a  Jacobian  matrix,  and  techniques  to  avoid 

refactoring  can  be  exploited.  To  my  knowledge,  there  has  been  very 

little  code  development  in  this  area  except  for  the  work  by  Enright 

[Enri77].  He  modified  two  codes,  one  of  Hindmarsh's  GEAR  codes  and  one 

of  his  second  derivative  codes  SDBAS1C  [Enri74]  and  [Addi79].  A  typical 

stiff  code  has  to  solve  a  nonlinear  equation  at  each  step  and  this  Is 

done  with  a  Newton  Iteration  which  requires  solving  a  linear  system 

involving  the  matrix  (1  -  hpjj  where  J  is  the  Jacobian.  Refactorization 

is  necessary  for  two  reasons:  the  Jacobian  changes  or  the  stepsize  h 

and/or  the  coefficient  p  changes.  If  the  system  is  linear,  the  Newton 

Iteration  converges  in  one  step,  which  is  one  saving.  Furthermore,  the 

Jacobian  doesn't  change  so  it  is  necessary  to  refactor  only  when  h  or  p 

change.  This  is  only  a  shift  to  the  matrix,  but  unfortunately  there  are 

no  known  fast  techniques  for  updating  the  LU  decomposition  after  a  shift 

of  a  matrix — indeed  it  appears  that  it  cannot  be  done  any  faster  than 

the  original  decomposition  in  general — so  Enright  uses  a  different 

decomposition,  LHL  *  where  L  is  unit  lower  triangular  and  H  is  upper 

Hessenberg.  A  shift  of  this  is  obtained  by  a  shift  of  the  H,  while  H 

2 

can  be  factored  in  0(n  )  steps.  A  further  advantage  of  a  constant 
Jacobian  accrues  in  the  case  of  second  derivative  methods,  namely  that 
the  calculation  of  the  second  derivative,  done  by  forming  Jy',  does  not 
require  re-evaluation  of  the  Jacobian  at  each  step.  (It  is  that  re- 
evaluation  that  makes  the  second  derivative  methods  less  competitive 
than  they  would  otherwise  be  when  compared  to  methods  that  only  loose 


stability  but  not  accuracy  when  the  Jacobian  is  somewhat  inaccurate.) 

6_.  Large  systems  of  equations 

The  major  difficulty  with  large  problems  is  matrix  manipulation  due 
to  the  need  to  solve  linear  equations  a  little  more  than  once  per  time 
step.  As  the  size  of  the  problem  increase,  the  percentage  of  time  in 
this  part  of  the  program  increases  and  it  becomes  important  to  look  for 
fast  methods.  When  the  problem  gets  very  large,  the  storage  for  the 
Jacobian  and  its  decomposition  may  also  be  a  problem.  Fortunately,  the 
Jacobian  for  large  problems  is  almost  always  sparse  because  at  least 
part  of  the  problem  has  arisen  from  the  partial  discretization  of  a  PDE. 
The  use  of  direct  sparse  techniques  has  received  a  fair  amount  of 
attention  and  one  version  of  LSODE,  namely  LSODES  which  uses  the  Yale 
sparse  matrix  package,  is  available.  Less  attention  has  been  paid  to 
iterative  methods  which  will  be  needed  for  the  really  large  problems, 
although  much  of  the  work  done  on  iterative  methods  for  symmetric 
problems  is  applicable  when  the  system  of  ODEs  arise  from  a  method  of 
lines  approach  to  many  PDEs.  Niranker  discusses  the  use  of  conjugate 
gradient  methods  for  symmetric  Jacobians  in  his  book  [Mira81].  An 
earlier  code  of  Hindmarsh,  GEARBI,  (see  [Hind76])  uses  a  block  SOR 
method  for  problems  arising  from  the  method  of  lines  applied  to  two 
space  dimension  PDEs.  Hindmarsh  has  indicated  that  this  will  be 
upgraded  to  an  LSODE  code.  However,  in  problems  in  which  the  Jacobian 
is  non-symmetric  and  even  has  an  irregular  non-zero  structure,  most  of 


the  currently  used  methods  for  solving  equations  iteratively  are  not  too 
satisfactory.  There  is  some  work  continuing  on  this  problem,  for 
example,  [GeSa81],  and  this  has  been  put  in  a  version  of  LSODE,  but  it 
has  not  been  distributed  because  it  still  represents  an  initial  attempt. 

If  the  problem  is  not  so  large  that  indirect  methods  spill  out  of 
primary  storage,  they  can  be  used  but  can  be  very  expensive.  In  that 
case,  there  is  a  lot  of  interest  in  techniques  which  minimize  the  number 
of  times  the  Jacobian  must  be  recomputed  and/or  the  matrix  refactored. 
There  are  few  codes  generally  available  based  on  these  techniques  but 
some  of  them  are  likely  to  appear  in  future  codes.  For  example,  Krogh 
and  Stewart  are  working  on  a  new  code,  STRUT,  based  on  BDFs  for  stiff 
equations  which  goes  to  considerable  effort  to  detect  whether  the 
Jacobian  need  be  re-evaluated  or  the  matrix  refactored.  See  [KrSt81a], 
[KrSt81b],  and  this  conference.  The  "constant  leading  coefficient" 
.technique  [JaSa80]  has  been  used  to  reduce  matrix  refactorizations  by 
holding  the  8  coefficient  constant.  This  changes  as  the  stepsizes 
change  in  variable  step  methods.  By  sacrificing  a  small  amount  of 
accuracy  it  can  be  held  constant. 

Z*  Implicit  differential  equations 

An  implicit  differential  equation  is  written  in  the  form 
F(y,  y',  t)  -  0 


and  can  be  handled  directly  by  a  number  of  methods  including  BDF  and 
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certain  implicit  Runge-Kutta  methods.  A  few  codes  have  been  written 
using  the  BDF  methods  but  until  recently  they  have  not  been  widely 
distributed.  Some  of  them  have  been  embedded  in  proprietary  network 
analysis  software  [HaBG71]  so  for  a  long  time  the  only  code  available 
was  a  modified  version  of  DIFSUB  which  was  "experimental"  in  the  worst 
meaning  of  the  word  (see  [Gear71]  and  [Brow74]).  Recently,  two  better 
codes  have  appeared,  LS0DA1  [Paln81],  and  DASSL  [Petz82].  Both  of  these 
are  based  on  the  BDF  methods  which  replace  the  derivative  in  the 
implicit  equation  with  a  difference  approximation.  For  example,  the 
first  order  method  is  the  Backward  Euler  method  and  is  obtained  by 
substituting  (yn  -  yn_^)/h  for  y^  to  get  the  nonlinear  equation 

F(v  -  Vi)/h>  '„>  ■ 0 

to  solve  for  yR  at  each  step.  To  ny  knowledge,  no  codes  are  available 
based  on  implicit  Runge-Kutta  methods,  although  such  methods  would  work 
provided  that  the  final  stage  is  evaluated  at  the  end  point  of  the  step 
and  is  used  to  provide  the  output  value. 

8_.  Differential-algebraic  equations 

These  problems  arise  when  an  implicit  system  represents  fewer 
differential  equations  than  there  are  variables  and  equations  because 
the  Jacobian  dF/dy'  is  singular.  For  some  time  it  has  been  known  that 
difficulties  can  be  encountered  with  some  such  systems,  but  only 
recently  has  the  extent  of  some  of  these  difficulties  been  comprehended. 
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The  current  situation  is  that  we  do  not  know  how  to  solve  the  general 
problem,  but  we  do  know  that  all  conventional  00E  methods  break  down. 
Fortunately  the  difficulties  do  not  arise  very  frequently,  so  large 
classes  of  problems  can  be  solved  by  the  software  mentioned  in  the  last 
section,  but  it  is  possible  to  construct  examples  which  are  physically 
reasonable  but  Impossible  to  handle,  and  there  are  cases  In  which 
examples  have  arisen  in  practical  simulation  studies.  The  nature  of  the 
difficulty  can  be  seen  in  some  very  simple  model  problems.  Consider  the 
system  of  three  equations 

x  -  g(t) 
y  *  x' 
z  »  y' 

These  are  clearly  not  differential  equations:  their  solution  is  uniquely 
defined  Independently  of  Initial  values.  If  a  fixed  step  integrator  is 
applied  to  these  it  is  equivalent  to  applying  numerical  differentiation 
to  g(t).  For  example,  if  the  Backward  Euler  method  is  applied,  we 
calculate 

X1  *  81 

yl  “  <X1  "  x0)/h 
zi  *  <yi  -  y0)/h 

This  numerical  calculation  uses  initial  values  for  x  and  y  which  are 
probably  incorrect,  so  the  values  of  y^  and  z^  will  be  incorrect.  In 
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fact,  they  will  diverge  as  h  tends  to  zero.  However,  it  can  be  shown 
that,  for  fixed  stepsize  methods  such  as  BDF  methods,  after  a  number  of 
steps  the  answers  have  the  required  degree  of  accuracy.  However,  if  the 
step  varies  or  the  problem  has  time  dependent  coefficients,  we  may  run 
into  difficulty.  Precisely,  we  must  talk  about  the  index  of  nllpotency 
of  the  system.  It  is  the  maximum  size  of  a  subsystem  of  the  sort  above. 
In  the  example  above  it  is  three.  If  this  index  exceeds  one,  the  time 
dependent  problem  cannot  be  solved  by  known  methods.  If  the  index 
exceeds  two,  variable  stepsize  methods  also  fail  for  the  constant 
coefficient  case.  Details  can  be  found  in  [Petz81]  and  [GeHP81].  A 
further  report  is  also  in  preparation  by  the  same  authors. 

Although  these  problems  do  not  arise  very  often,  they  are  known  to 
arise  on  occasions.  For  example,  [Pain8l]  states  that  the  Navier  Stokes 
equations  have  an  index  of  two.  They  were  integrated  successfully  by  an 
implicit  ODE  code,  perhaps  because  the  critical  parts  of  that  problem 
are  linear.  It  would  be  highly  desirable  for  a  code  to  detect  the 
existence  of  a  high  index,  but  at  the  present  we  do  not  know  how  to  do 
that.  The  symptom  of  such  a  problem  in  an  automatic  code  is  that  the 
step  control  mechanism  gets  into  trouble  and  reduces  the  step  endlessly 
without  reducing  the  error — possibly  even  increasing  it,  so  that 
eventually  the  code  reports  that  the  error  tolerance  cannot  be  achieved, 
leaving  the  user  to  guess  the  reason.  Furthermore,  since  it  happens 
only  when  the  Jacobian  with  respect  to  y'  is  singular,  there  are 
conventional  stiff  equations  arbitrarily  close  to  these  very  hard 
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problems.  If  such  systems  are  Integrated  with  existing  integrators,  it 
is  possible  that  they  will  fail  also. 

9^  Differential  equations  with  discontinuities 

Discontinuities  in  the  derivatives  of  solutions  cause  difficulties 
in  codes  unless  they  are  handled  carefully.  The  difficulty  arises 
because  the  code  is  proceeding  under  the  assumption  that  the  solution  is 
well  approximated  by  a  smooth  form  such  as  a  polynomial.  This  is 
clearly  not  true  when  there  is  a  discontinuity  in  a  derivative  of  order 
lower  than  the  degree  of  the  polynomial.  If  a  one-step  method  is  being 
used,  the  problem  can  be  overcome  by  placing  a  mesh  point  at  the 

discontinuity  so  that  separate  smooth  approximations  are  used  on  each 
side  of  the  discontinuity.  (Even  then,  the  code  has  to  be  organized 
carefully.  If,  for  example,  a  trapezoidal  rule  is  being  used  to 
Integrate  the  problem  y'  -  f^(t)  where  i  ■  0  for  t  <  t£  and  i  -  1  for  t 

>  tc,  the  step  ending  at  tc  must  be  integrated  using  fg,  even  when  ;  '  is 

evaluated  at  t£  in  that  step.  After  that  step  has  been  completed,  the 
switch  to  fj  should  be  made.  This  means  that  the  code  to  evaluate  the 
derivatives  must  contain  a  switch  and  be  told  by  the  integrator  when  the 
discontinuity  has  been  passed.)  If  a  multistep  method  Is  being  used,  the 
order  of  the  multistep  method  must  not  exceed  the  order  of  the 

discontinuity  in  any  steps  which  use  information  from  both  sides  of  the 
discontinuity,  meaning  that  we  also  need  to  know  the  order  of  the 
discontinuity.  Alternatively,  we  can  simply  restart  a  multistep  method 


Co  avoid  Chis  problem 


From  this  it  is  apparent  that  we  need  to  know  the  location  of  the 
discontinuity  and  possibly  its  order.  If  the  discontinuities  occur  at 
knwon  values  of  the  independent  variable  t  the  problem  is  fairly 
straightforward  and  is  simply  a  matter  of  code  organization.  If  the 
discontinuities  occur  at  known  values  of  the  dependent  variables,  or 
given  functions  of  them,  it  is  necessary  to  first  locate  a  discontinuity 
by  monitoring  the  values  of  these  functions,  which  we  call  switching 
functions.  Carver's  FORSIM  code  provides  such  options  for  locating 
discontinuities.  For  a  discussion  see  [Carv77].  The  location  of  a 
particular  point  by  monitoring  functions  of  dependent  variables  goes 
back,  I  think,  to  an  early  Illiac  I  RK  code  that  was  probably  written  by 
Nordsieck  (I  don't  have  any  records  on  that,  I  just  remember  using  it 
around  1958).  It  was  also  put  into  one  of  Fred  Krogh's  pioneering 
variable-order,  variable-step  codes  under  the  name  of  the  GSTOP  feature. 

In  the  two  cases  above,  the  detection  of  a  discontinuity  is 
possible  because  the  user  provides  switching  functions  to  indicate  their 
presence.  In  the  worst  scenario,  we  do  not  even  have  that  information 
(perhaps  because  the  derivatives  are  being  evaluated  by  a  large  section 
of  code  that  cannot  be  analyzed  reasonably,  or  depend  on  tabular  data). 
Then  we  must  also  attempt  to  detect  the  presence  of  a  discontinuity. 
Thus  we  see  that  there  are  three  problems  to  handle  with 


discontinuities: 
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Detect ion, 

Location,  and 
Restarting. 

Detection  has  been  investigated  for  non-stiff  problems  in  [Ce0s81].  It 
is  not  clear  that  the  technique  used  can  be  extended  to  stiff  problems. 
Fortunately,  we  usually  have  switching  functions  to  take  care  of  thLs. 
The  second  step  is  equivalent  to  that  of  root  finding  for  the  equation 
g(y,t)  ■  0  where  y(t)  is  given  by  the  numerical  integrator.  Hlndmarsh 
will  have  a  new  version  of  LSODE,  called  LSODAR,  for  this  problem,  but 
it  is  not  yet  ready  for  distribution. 

Restarting  is  important  in  multistep  codes  because  they  are  usually 
less  efficient  in  the  first  few  steps  which  are  taken  at  low  order  and 
small  stepsize.  RK  codes  can  commence  with  a  normally  sized  step 
immediately.  The  problem  of  starting  at  a  modest  order  (four)  with  a 
reasonable  stepsize  is  examined  in  [Gear80a]  where  a  special  form  of  a 
RK  step  is  used  to  generate  enough  information  for  a  four-step  multistep 
method  to  continue.  A  code  for  the  nonstiff  method  is  available  in  a 
related  report.  There  is  a  version  of  the  technique  for  stiff  problems, 
but  as  a  general  rule  a  problem  will  not  be  stiff  after  a  discontinuity 
because  transient  components  requiring  a  small  stepsize  for  accuracy 
will  have  been  stimulated.  However,  this  is  not  always  true.  I  have 
encountered  one  problem  in  which  the  problem  remained  stiff  although 
there  were  discontinuities  fairly  frequently.  A  one-dimensional  PDE 


describing  a  heat  conducting  material  which  was  melting  as  heat 
propagated  had  been  converted  into  a  system  of  ODEs  by  the  method  of 


lines.  The  underlying  heat  equation  kept  the  problem  stiff  at  all 
times.  As  the  solid-liquid  interface  passed  over  a  line,  there  was  a 
discontinuity  because  of  the  latent  heat  effects  and  the  jump  change  in 
the  heat  conductivity.  This  problem  was  causing  difficulty  for  any  of 
the  integrators  the  user  was  trying.  (As  is  common,  I  don't  know  if  it 
was  finally  solved.  The  user  did  not  return  after  the  third  visit, 
either  because  by  suggestion  of  that  meeting  worked  or  he  decided  that  I 
was  wasting  his  time!) 

10.  Problems  with  high-frequency  components 

These  problems  are  ones  in  which  the  high  frequency  components  are 
an  integral  part  of  the  problem  and  must  not  be  damped  because  such 
damping  would  change  the  solution,  or  cannot  be  damped  because  they 
arise  from  non-linear  oscillators.  Petzold  developed  a  preliminary  code 
as  a  modification  of  DIFSUB  in  her  thesis  [Petz78].  The  underlying 
method  is  called  a  multirevolutionary  method  because  it  attempts  to  step 
over  many  cycles  (or  revolutions,  in  the  case  of  satellite  orbits  where 
the  method  was  first  investigated).  The  basic  idea  is  that  a  single 
cycle  is  investigated  numerically  by  integrating  the  original 
differential  equation  with  a  conventional  integrator.  The  results  from 
this  are  used  to  predict  the  rate  of  change  of  the  behavior  of  the 
starting  point  of  each  cycle.  If  the  behavior  of  the  starting  points  is 
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"smooth"  tt  is  possible  to  extrapolate  forward  over  many  cycles  without 
the  need  to  integrate  over  each  one.  This  extrapolation  looks  like  an 
integration  method  with  modifications  proportional  to  the  ratio  of  the 
period  to  the  stepsize.  We  are  using  generalizations  of  the  Adams  and 
BDF  methods  for  this  extrapolation.  Complications  arise  because  the 
period  may  be  changing  so  must  be  recalculated  from  the  solution,  and 
because  the  problem  may  be  very  stable  giving  rise  to  the  same  effect  as 
stiffness  in  the  multlrevolutlonary  method.  Gallivan  and  myself  have 
continued  that  work  at  Illinois  and  are  writing  a  code  embodying  a 
number  of  new  techniques  for  detecting  the  oscillatory  behavior, 
deciding  when  to  use  the  multirevolutionary  method  because  it  might  be 
more  efficient  than  a  conventional  method,  and  deciding  when  to  use  the 
generalized  BDF  rather  than  the  generalized  Adams  .  (See 
[GeGaBl] . ) 

Problems  with  more  than  one  frequency  probably  cannot  be  handled  by 
this  method,  so  these  remain  something  of  an  open  problem.  Methods 
employing  Fourier  expansions  are  one  possibility,  but  I  know  of  no 
general  software  embodying  these. 

•  A_L*  Partitioned  systems 

It  may  be  profitable  to  split  a  large  system  of  equations  into  one 
or  more  subsystems  in  order  that  each  subsystem  can  be  handled  by 
different  methods  and/or  different  stepslzes.  Different  methods  could 
be  profitably  used  if  the  stiffness  of  a  system  is  confined  to  a 


subsystem.  Different  stepsizes  could  be  used  if  the  bandwidth  of  one 
subsystem  is  significantly  less  than  that  as  another.  Even  if  the  same 
method  and  stepsize  is  used  in  all  subsystems,  gains  are  still  possible 


if  the  matrix  arithmetic  associated  with  stiffness  is  avoided  in  the  non 
stiff  subsystems.  There  are  relatively  few  codes  available  embodying 
these  techniques.  [SodeflOJ  describes  a  code,  DASP3,  which  uses  a 
conventional  four-stage  RK  explicit  method  for  one  subsystem,  and  a 
three-step  BDF  method  for  the  other.  It  is  necessary  for  the  user  to 
partition  the  system  into  the  form 

y'  -  f(y,z,t) 
ez'  *  g(y,z,t) 

a  priori.  The  code  does  rot  make  this  determination  automatically. 
Various  approaches  to  using  different  stepsizes  are  discussed  in 
[Gear80],  and  a  code  which  will  allow  a  system  to  be  divided  into  a 
number  of  subsystems,  each  to  be  handled  by  different,  automatically 
chosen,  stepsizes  and  orders,  is  begin  written  by  D.  Wells  at  the 
University  of  Illinois  as  part  of  his  Ph.D.  thesis.  This  code  also 
requires  the  user  to  do  the  partitioning.  Of  course,  one  approach  to 
automatic  partitioning  is  to  treat  each  equation  as  a  separate  subsystem 
and  leave  it  free  to  select  its  own  stepsize,  order,  and  even  method 
(e.g.  stiff  versus  nonstiff).  Unfortunately,  it  appears  that  the 
overhead  of  such  an  approach  will  be  so  large  that  any  savings  in  the 
actual  integration  will  be  negated. 
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One  difficulty  with  partitioning  is  that  it  is  based  on  the  hope 
that  certain  properties  are  confined  to  subsystems.  In  general,  this  is 
not  true.  For  example,  in  a  linear  system  in  which  the  Jacobian  is  not 
reducible,  the  dominant  eigenvectors  (those  corresponding  to  the  large 
eigenvalues)  will  have  components  in  all  dependent  variables,  so  that 
stiffness  cannot  be  confined  to  a  subsystem.  If  a  transformation  to  a 
basis  of  eigenvectors  were  to  be  made,  we  could  confine  stiffness  to  a 
subsystem,  but  this  is  not  usually  practical.  However,  it  is  not 
necessary  to  have  a  basis  of  eigenvectors,  it  would  suffice  to  have  a 
basis,  part  of  which  spanned  the  dominant  eigenspace  and  the  other  part 
of  which  spanned  the  remainder.  (This  is  really  what  the  iterative 
methods  are  attempting  to  do  in  [GeSa81}].)  An  approach  to  this  is  the 
technique  in  [EnKa79]  in  which  the  iteration  matrix  is  decomposed  into 
an  LHL  ‘  form  where  H  is  upper  Hessenberg  and  L  is  a  combination  of  a 
lower  triangular  matrix  and  permutations  chosen  to  put  the  large 
components  into  the  first  part  of  H.  In  effect,  the  lower  part  of  H  is 
ignored,  corresponding  to  ignoring  components  due  to  the  non-dominant 
eigenspace.  In  practice,  the  reduction  is  terminated  after  the  large 


components  have  been  handled. 

Since 

this  matrix  is  used 

to 

solve 

the 

linear  equations  arising  in 

the 

Newton  iteration, 

the 

effect 

of 

discarding  the  small  part  of  H  is  to  handle  the  stiff  components  by  a 
Newton  step,  and  the  nonstiff  components  by  a  functional  iteration  step. 
In  [GeSa81]  it  is  shown  how  this  can  be  used  to  treat  the  stiff 
components  with  BDF  and  the  non-stiff  components  with  Adams  methods. 
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However,  there  are  no  distributed  codes  based  on  these  techniques  to  my 
knowledge. 

12 .  Delayed  equations 

Delayed,  or  retarded,  differential  equations  can  also  exhibit 
stiffness.  There  has  been  some  work  recently  on  codes  for  these 
problems.  Among  these  are  codes  based  on  STFDIF,  a  composite,  block, 
one-step  method  by  the  group  at  Syracuse  (see  the  paper  by  Bickart  at 
this  conference),  and  a  code  based  on  DIFSUB  by  Roth  and  Watanabe  at 
Illinois  ( [Roth80]  and  [WaRo82]). 

13.  Global  error  estimation  and  control 

Most  codes  attempt  to  control  no  more  than  an  estimate  of  the  local 
error,  that  is,  the  error  introduced  in  a  single  step.  If  "error-per- 
unit-step"  control  is  used,  the  stepsize  is  adjusted  so  that  the  error 
permitted  is  believed  to  be  less  than  a  tolerance,  TOL,  scaled  by  the 
stepsize  used.  Then,  if  there  is  no  error  growth  due  either  to  the 
problem  or  the  method,  the  error  aFter  an  interval  of  length  L  should  be 
bounded  by  L*TOL.  However,  the  actual  error  depends  on  the 
amplification,  or  otherwise,  of  the  errors  by  the  differential  equations 
and  method.  Some  codes  have  been  written  for  the  nonstiff  problem  to 
estimate  global  errors,  for  example,  GERK  [ShWa78].  When  the  problem  is 
stiff,  errors  introduced  in  the  early  stages  into  the  stiff  components 
decay  due  to  the  super  stability  of  the  problem  (provided  the  method 
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also  exhibits  the  same  stability).  In  that  case,  the  best  stepsize 
strategy  is  not  an  error-per-unit  step  strategy,  but  something  closer  to 
an  error-per-step  strategy  In  which  the  local  error  estimate  is 
controlled  to  be  less  than  TOL.  Dahlquist  [Dahl81]  is  Investigating  the 
control  of  global  error  in  stiff  equations.  It  Is  an  important  problem 
because  the  relationship  between  the  user  supplied  error  control 
parameter  TOL  and  the  global  error  is  even  less  understood  by  the 
typical  user  than  in  nonstiff  problems. 

14 .  Automatic  method  switching 

Until  recently,  all  codes  either  required  the  user  to  specify 
whether  a  stiff  or  nonstiff  method  should  be  used,  or  provided  no 
choice.  Since  most  problems  are  not  stiff  everywhere  as  the  behavior  in 
an  initial  transient  is  best  handled  by  a  nonstiff  method,  it  is  highly 
desirable  to  switch  between  nonstiff  and  stiff  methods.  A  number  of 
Shampine's  codes  for  nonstlff  problems  have  detection  mechanisms  that 
warn  users  of  potential  problems  such  as  stiffness.  Stewart  has 
investigated  using  these  to  switch  automatically  from  Adams  methods  as 
implemented  in  Shampine's  codes  STEP,  to  BDF  methods  for  stiff 
equations.  The  code  STRUT  being  written  by  her  and  Krogh  is  expected  to 
Include  this  mechanism.  Petzold  has  also  investigated  techniques  for 
this  [Petz82b],  and  has  modified  a  version  of  LSDOE  with  Hindmarsh. 
Although  not  distributed  yet,  it  is  receiving  experimental  use  at 
Livermore  under  the  name  LSODA. 
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At  Illinois  we  are  using  similar  techniques  to  switch  to  and  from 
stiff  methods  in  the  code  being  written  for  oscillatory  methods.  The 
basic  idea  is  to  estimate  the  stepsizes  that  can  be  used  in  the  stiff 
method  and  in  the  non-stiff  method.  In  the  case  of  nonstiff  methods, 
the  stepsize  must  be  limited  to  keep  an  estimate  of  the  largest 
eigenvalues  inside  the  region  of  absolute  stability.  These  are  then 


divided 

into  estimates  of 

the  cost 

per 

step 

of  each 

method 

(stiff 

methods 

are  much  more 

expensive 

per 

step 

becaus  e 

of  the 

matrix 

operations),  and  the  smallest  resulting  value  is  used  to  select  the 
method.  In  our  codes  this  is  done  for  each  of  several  orders  in  Adams 
and  BDF  methods  so  that  order  selection  is  handled  at  the  same  time. 
One  difficulty  that  has  been  encountered  (and  requires  careful  tuning  of 
the  code)  arises  if  the  stepsize  in  the  nonstiff  integrator  is  not 
controlled  strongly  enough  to  keep  the  method  absolutely  stable.  In 
that  case  the  numerical  solution  exhibits  small  oscillations  at  the 
error  tolerance  level.  The  integrator  calculates  stepsize  estimates  by 
estimating  derivatives  of  the  solution  using  numerical  differentiation. 
The  oscillating  errors  give  rise  to  artificially  large  derivative 
estimates  which  cause  the  stepsize  estimates  to  be  too  small,  so  that  it 
appears  that  stability  is  not  a  problem.  Since  the  BDF  methods  have 
larger  error  coefficients  than  Adams  methods,  a  switch  is  never  made  to 
the  BDF  methods.  It  has  proved  necessary  to  ensure  that  the  stepsize  is 
kept  small  enough  in  Adams  methods  to  be  certain  that  the  method  is  well 
inside  the  region  of  absolute  stability. 


IS .  Smooth  solutions 


The  output  of  most  integrators  is  far  from  being  a  smooth  function 
of  its  inputs  because  if  a  change  is  made  to  one  of  the  inputs,  say  an 
initial  value,  the  stepsize  and  order  sequence  may  change  drastically, 
causing  a  significant  change  in  the  output.  In  applications  In  which 
the  user  is  only  concerned  with  the  difference  between  the  numerical  and 
true  solutions  this  is  not  a  problem.  However,  there  are  many 
situations  in  which  the  output  of  the  integrator  is  the  input  to  another 
program  such  as  an  optimizer.  An  example  of  this  occurs  in  the 
estimation  of  reaction  rates  in  chemical  kinetic  equations.  These  are 
often  solved  by  performing  a  least  -  squares  fit  to  observed 
concentrations  by  re-integrating  the  equations  with  different  values  of 
the  reaction  rates.  We  have  been  particularly  concerned  with  this 
problem  because  we  use  an  integrator  to  determine  the  behavior  of  one 
cycle  for  use  in  the  multirevolutionary  methods  for  oscillating 
problems. 

We  would  like  to  see  methods  developed  which  obey  certain 
smoothness  properties.  In  particular  we  would  like  methods  for  which  we 
can  get  not  only  bounds  on  the  global  error  of  the  solution,  but  we  can 
get  bounds  on  the  difference  between  the  numerical  derivative  of  the 
solution  with  respect  to  any  input  parameter  and  the  corresponding 
derivative  of  the  true  solution.  A  student,  Vu  Thu,  at  Illinois  is 
working  on  this  problem  for  ODEs.  The  critical  difficulty  is  that  a 
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code  has  Co  avoid  making  any  "absolute"  decisions  such  as  step 

rejection,  because  these  Invariably  lead  to  discontinuous  behavior.  The 
code  cannot  include  any  "if"  statements,  MAX  and  MIN  functions,  or 
similar  functions  with  discontinuous  properties. 

16.  Conclusions 

Although  there  are  a  number  of  good  codes  for  standard  stiff 

problems  there  are  many  areas  for  which  code  development  is  still 

needed,  and  other  areas  in  which  we  do  not  yet  have  adequate  methods. 
The  experience  with  the  early  codes  suggests  that  more  effort  in  code 
development  after  the  first  "production"  vers  Lon  has  been  written  is 
needed  if  a  code  is  to  become  widely  used. 
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Abstract 

The  difficulties  inherent  in  attempts  to  evaluate  methods 
designed  for  stiff  equations  are  identified  and  discussed.  It 
is  observed  that  while  evaluations  of  individual  methods  are 
worthwhile  and  often  provide  insights  into  the  method's  per¬ 
formance  attempts  to  compare  the  performance  of  different  methods 
seldom  prove  fruitful  because  of  the  many  factors  that  contribute 
to  the  effectiveness  of  a  stiff  solver.  Some  recent  progress 
and  applications  of  the  evaluation  of  stiff  methods  are  reviewed 
and  recommendations  are  made  regarding  where  evaluations  of  stiff 
methods  can  be  most  helpful. 

1.  Introduction 

During  the  past  decade  several  new  approaches  for  the 
numerical  solution  of  stiff  equations  have  been  proposed  and  many 
of  these  have  been  implemented  as  software  packages  intended  for 


general  use.  In  this  investigation  we  will  identify  some  of  the 
pitfalls  inherent  in  attempts  to  evaluate  the  performance  of 
stiff  methods  and  we  will  review  some  of  the  progress  that  has 
been  made.  We  will  distinguish  between  the  comparison  of  dif¬ 
ferent  methods  and  the  assessment  oi  an  individual  method.  One 
of  our  recommendations  will  be  that  while  the  latter  is  feasible 
and  can  be  very  worthwhile  the  former  is  controversial  and 
subject  to  many  difficulties. 

It  must  be  emphasized  that  comparing  or  assessing  stiff 
methods  should  not  be  thought  of  as  a  static  activity.  While 
it  is  true  that  attempts  to  compare  stiff  methods  has  not  lead 
to  many  definitive  statements  about  the  relative  merits  of 
various  approaches,  it  has  lead  to  a  better  understanding  of 
numerical  methods  and  has  resulted  in  improvements  and  modifica¬ 
tions  which  otherwise  may  not  have  been  possible. 

Before  discussing  the  evaluation  and  comparison  of  methods 
in  detail  we  will  review  what  a  stiff  problem  is ;  what  comprises 
a  stiff  method;  and  what  a  stiff  method  is  attempting  to  do.  In 
doing  this  we  will  identify  those  features  which  cause  the  most 
difficulty  when  one  attempts  to  compare  stiff  methods.  In  the 
next  section  we  will  clarify  what  a  stiff  problem  is  and  discuss 
particular  properties  that  a  stiff  problem  can  exhibit  which  can 
have  an  influence  on  the  performance  of  a  method.  We  will  then 
introduce  the  various  components  that  comprise  a  stiff  method 
and  discuss  how  each  can  contribute  to  the  performance  of  a 
method.  Indeed  one  must  appreciate  that  when  assessing  a  stiff 
method  one  is  not  only  assessing  the  underlying  formula  (or  class 
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of  formulas)  but  also  the  numerous  other  strategies  which  col¬ 
lectively  determine  the  method's  performance. 

In  the  fourth  section  we  will  discuss  the  objectives  of 
contemporary  stiff  methods  and  how  these  objectives  are  achieved 
We  will  discuss  in  particular  how  the  accuracy  of  a  numerical 
method  is  controlled  and  suggest  an  approach  which  could  lead  to 
a  more  meaningful  and  uniform  interpretation  of  accuracy  for 
stiff  solvers.  Other  special  features  of  problems  or  methods 
which  complicate  the  evaluation  of  a  method  will  then  be  iden¬ 
tified.  In  the  last  section  we  will  review  some  of  the  progress 
that  has  been  made  and  indicate  where  future  advances  are  most 
likely. 


2.  Properties  of  a  Stiff  Problem 


While  the  term  'stiffness'  is  widely  used  it  is  a  difficult 
concept  to  define  precisely.  Shampine  and  Gear  (1979)  discuss 
the  notion  of  stiffness  in  some  detail  and  identify  properties 
they  feel  must  be  present  for  a  problem  to  be  considered  stiff. 
It  must  be  understood  that  stiffness  is  a  characteristic  that 
depends  on  the  differential  equation,  the  length  of  integration, 
the  initial  conditions  and  the  accuracy  requirement  and  that  in 
addition  a  problem  can  be  stiff  for  parts  of  the  integration  and 
nonstiff  for  other  parts.  We  will  consider  a  problem  to  be 
stiff  if  during  its  solution  by  classical  methods  (such  as  an 
explicit  Runge  Kutta  method  or  an  Adams  multistep  method)  the 
stepsize  is  severely  constrained  by  the  numerical  stability  of 
negligible  transient  components  rather  than  the  accuracy  of  the 


significant  components.  In  particular  we  are  assuming  that  some 
component  of  the  solution  of  a  stiff  problem  must  become  negli¬ 
gible  (relative  to  the  accuracy  requirement).  These  components 
generally  will  be  significant  only  in  the  initial  or  transient 
region.  Problems  with  high  frequency  undamped  components  are  a 
special  case  and  can  only  be  considered  stiff  if  the  initial 
conditions  are  such  that  the  amplitudes  of  the  high  frequency 
components  are  small  relative  to  the  accuracy  requirement.  For 
this  class  of  problems  the  inherent  cost  of  the  integration  is 
extremely  sensitive  to  the  accuracy  requirement  and  the  specified 
initial  conditions  -  a  situation  that  is  definitely  not  typical 
of  stiff  problems.  (For  this  reason  some  people  would  not  con¬ 
sider  this  type  of  problem  to  be  stiff.) 

Stiff  problems  often  involve  large  systems  of  equations  and 
when  they  do  it  is  essential  that  any  special  structure  be  ex¬ 
ploited.  For  example  the  problem  could  have  a  sparse  Jacobian, 
it  could  possess  only  a  few  transient  components  or  it  could  be 
linear.  In  each  of  these  cases  methods  can  exploit  the  corres¬ 
ponding  structure  although  the  extent  to  which  it  can  be  exploited 
is  very  method  dependent  as  is  the  ability  to  automatically  detect 
the  existence  of  such  structure. 

3.  Components  of  a  Stiff  Method 

It  has  long  been  recognised  that  a  method  for  initial  value 
problems  in  ODEs  consists  of  much  more  than  a  formula  (or  a  class 
of  formulas)  for  advancing  the  solution  in  a  step-by-step  fashion. 
It  must  also  include  a  local  error  estimator  and  a  stepsize 
choosing  strategy  designed  to  control  the  accuracy  of  the  inte- 


gration.  In  methods  designed  for  stiff  systems  these  latter 
two  strategies  can  have  a  significant  impact  on  the  overall 
efficiency  and  reliability  of  a  method  as  can  the  iteration 
control  strategy  and  the  linear  equation  solver  that  is  used. 

Since  the  time  required  to  solve  a  stiff  problem  is  often  domi¬ 
nated  by  the  time  required  to  solve  the  systems  of  equations 
that  arise  on  each  time  step*  strategies  involved  in  this 
iteration  can  also  be  critical.  While  most  methods  solve  these 
systems  of  equations  using  a  modified  Newton  iteration  they 
differ  in  their  respective  stopping  criterion. 

Methods  which  are  not  based  on  A-stable  formulas  often 
encounter  difficulties  when  solving  problems  having  eigenvalues 
of  the  Jacobian  close  to  the  imaginary  axis.  This  difficulty  is 
often  compounded  when  the  step-choosing  strategy  is  based  only 
on  asymptotic  (h-*-0)  properties  of  the  local  error  estimator. 
Methods  that  include  an  heuristic  for  detecting  when  numerical 
stability  is  influencing  the  error  estimates  and  which  choose 
the  stepsize  (and  possibly  the  order)  accordingly  can  deal  with 
this  difficulty  in  a  more  robust  way. 

4.  Objectives  of  a  Stiff  Method 

The  methods  that  have  been  developed  for  stiff  problems 
over  the  past  decade  have  tended  to  have  similar  calling  sequences 
but  the  interpretation  of  some  of  the  parameters  can  be  quite 
different  for  different  methods.  This  is  particularly  true  of 
the  error  control  strategy  and  the  way  in  which  the  error  is 


measured. 


Methods  inevitably  attempt  to  control  some  measure  of  the 
global  error  (relative  to  the  requested  accuracy,  TOL)  by 
monitoring  and  controlling  some  measure  of  the  local  error  on 
each  step  of  the  integration.  While  the  relationship  between 
local  and  global  errors  for  stiff  systems  is  not  well  understood, 
most  methods,  primarily  on  the  basis  of  numerical  evidence, 
attempt  to  ensure  that  the  magnitude  of  the  local  error  per  step 
is  bounded  by  TOL  on  each  step.  When  this  type  of  error  control 
is  used  the  relationship  between  the  global  error  and  TOL  is 
inherently  method-dependent  as  the  bound  on  the  global  error 
depends  on  the  stepsizes  used.  Contrai'y  to  popular  belief  it 
is  not  true  that  control  of  the  magnitude  of  the  local  error  per 
step  alone  will  guarantee  control  of  the  global  error.  A  method 
which  chooses  a  sufficiently  small  constant  stepsize  h>0,  and 
generates  the  sequence  of  approximation  (x^,yi)  i=l,2,...N  with 
X£=xQ+ih,  yi=yQ  when  applied  to  the  stiff  problem 

y’  =  f(x,y),  y(xQ)  =  y0  (1) 

will  satisfy  the  local  error  per  step  criterion  but  will  produce 
a  numerical  solution  of  dubious  value.  When  interpreting  the 
accuracy  of  a  solution  produced  by  a  method  which  claims  only  to 
control  the  magnitude  of  the  local  error  on  each  step,  a  user 
must  know  more  information  (such  as  the  average  stepsize)  before 
he  is  justified  in  having  any  confidence  in  the  accuracy  of  the 
solution.  Since  most  methods  do  use  this  type  of  error  control 
they  produce  solutions  whose  global  accuracy  is  related  to  the 


specified  tolerance  in  a  very  method-dependent  fashion.  If  one 
is  only  interested  in  endpoint  accuracy  this  difficulty  is  not 
usually  as  acute,  but  if  one  wishes  accuracy  throughout  the 
interval  this  difficulty  can  make  comparison  especially  awkward. 

One  way  to  overcome  this  difficulty  would  be  to  ensure  that 
the  next  generation  of  codes  are  based  on  a  compatable  interpre¬ 
tation  of  the  parameter  TOL.  Direct  control  of  the  magnitude  of 
the  local  error  per  unit  step  would  result  in  such  a  method- 
independent  relationship  between  a  global  error  bound  and  TOL 
but  this  is  known  to  be  inefficient  for  stiff  problems.  An 
alternative  approach  would  be  to  control  the  magnitude  of  the 
local  error  per  step  but  to  combine  this  with  local  extrapolation. 
Since  extrapolation  would  alter  the  stability  properties  of  the 
method,  one  would  have  to  develop  new  formulas  or  use  a  lower 
order  predictor  in  the  local  error  estimator.  A  more  promising 
approach  which  could  also  lend  to  a  method  independent  interpre¬ 
tation  of  tolerance  is  based  on  a  variable  local  error  per  unit 
step  designed  to  control  the  magnitude  of  the  defect  of  the 
numerical  solution.  If  a  method,  when  applied  to  the  stiff 
initial  value  problem  (1),  produces  the  discrete  numerical  solution 
*xiyi^  as  an  approximation  to  (x^,y(x^))  i=l,2,...N 

and  there  exists  a  continuous  piecewise  differentiable  function 
z(x)  defined  on  [xq,Xjj]  interpolating  (x^,y^)  i=l,2,...N  such  that 

z’  = 


f(x,z)  +  6(x), 


then  6 (x)  is  the  defect  associated  with  the  approximate  solution 
z(x).  This  point  of  view  has  been  applied  successfully  in  the 
analysis  of  non-stiff  methods  (see  for  example  Hanson  and  Enright 
(1981))  and  it  is  hoped  that  it  will  prove  fruitful  in  the  analysis 
of  stiff  methods  as  well.  This  analysis  will  be  complicated  by 
the  fact  that,  to  account  for  the  very  stable  global  error  control 
observed  in  practice  one  will  have  to  consider  the  direction  of 
the  local  error  (and  hence  the  defect)  as  well  as  its  magnitude. 

We  will  return  to  this  point  in  the  next  section. 

Most  codes  provide  options  which  dictate  how  the  error  should 
be  measured  and  this  complicates  the  comparison  of  methods  as 
the  options  provided  by  different  methods  are  often  not  compatable. 
Pure  absolute  error  control  is  the  simplest  but  it  can  be  inap¬ 
propriate  for  stiff  problems  as  they  often  have  solutions  with 
components  of  widely  different  magnitude.  Componentwise  relative 
error  has  been  suggested  for  stiff  problems,  but  control  of  local 
error  in  a  relative  sense  does  not  imply  that  the  relative  global 
error  is  being  controlled  and  indeed  it  is  very  difficult  to 
say  what  effect  this  has  on  the  global  error.  On  the  other  hand, 
if  one  interprets  global  accuracy  from  the  defect  point  of  view 
then  componentwise  control  of  the  local  error  relative  to  the 
magnitude  of  the  derivative  would  lead  to  a  defect  that  is  small 
relative  to  f(x,z).  Shampine  has  recently  argued  (Shampine 
(1982))  that  for  most  problems  a  componentwise  weighted  absolute 
error  control  is  appropriate  with  the  weights  depending  on  the 
solution.  As  he  points  out  this  makes  it  awkward  for  users  to 
choose  the  appropriate  weights  without  a  few  preliminary  integra¬ 
tions,  but  for  test  problems  this  is  probably  the  best  error 


measure  to  U3e. 


5 .  Other  complications 


In  addition  to  the  difficulties  discussed  above  that  arise 
because  of  inherent  properties  of  stiff  problems,  stiff  methods 
or  the  error  control  strategy  there  are  other  difficulties  that 
can  arise  which  complicate  any  attempt  to  compare  stiff  methods 
some  stiff  problems  have  an  unstable  manifold  of  solutions  close 
to  the  solution  that  is  being  sought.  For  example  it  is  well 
known  that  in  stiff  problems  arising  from  chemical  kinetics  the 
problems  are  stable  and  well  conditioned  provided  the  concentra¬ 
tions  remain  nonnegative,  but  a  problem  can  become  mathematically 
unstable  if  a  negative  concentration  is  introduced.  During  a 
numerical  solution  it  is  possible  that  a  small  negative  concen¬ 
tration  (relative  to  the  accuracy  requirement  or  the  round-off 
level)  will  be  introduced  and  this  can  lead  to  an  unstable 
trajectory  being  followed  (Edsberg  (1976)  discusses  this  dif¬ 
ficulty  in  more  detail).  Although  some  methods  may  be  more  prone 
to  this  difficulty  than  others  it  is  not  clear  that  it  should 
be  considered  a  fault  of  the  method  as  long  as  the  specified 
error  control  strategy  is  being  respected.  Certainly  one  instance 
of  this  difficulty  can  distort  a  comparison  of  methods. 

Recently  some  new  techniques  for  stiff  equations  have  been 
proposed  for  which  even  linear  analysis  is  difficult.  The 
familiar  stability  definition  such  as  A-stability  and  A(a)- 
stability  are  based  on  the  assumption  that  analysing  the  scalar 
equation  y'=Xy  for  each  eigenvalue  \  of  the  matrix  A  will  allow 
one  to  characterize  the  stability  and  propogation  of  local  errors 
for  the  stiff  system,  y’=Ay.  This  assumption  is  valid  for  most 


10 


stiff  methods  since  the  nonsingular  similarity  transformation 
which  reduces  A  to  its  Jordan  canonical  form  will  also  'uncouple* 
most  methods  and  reduce  the  original  problem  to  an  equivalent 
sequence  of  scalar  problems  (of  course  the  equivalent  error 
measures  will  change  but  for  a  given  sequence  of  stepsizes  and 
a  fixed  formula  the  solutions  will  be  equivalent).  Some  *$pcent 
approaches  such  as  those  based  on  nonlinear  combinations  of 
the  derivative  (such  as  those  suggested  by  Lambert  (1974))  or 
those  that  use  different  formulas  for  different  components  of 
the  system  (such  as  those  suggested  by  Hofer  (1976)  or  Soderlind 
(1981))  do  not  satisfy  this  assumption.  When  a  method  does 
satisfy  this  assumption  then  it  is  clear  that  when  one  is  in 
the  transient  region  |hA^|<<  for  eigenvalues  Ai  corresponding 
to  the  smooth  components  and  consequently  there  will  be  virtually 
no  error  (except  for  round-off)  in  the  direction  of  these  com¬ 
ponents  with  the  local  error  almost  entirely  in  the  subspace 
spanned  by  the  transient  components.  Similarly  after  the  tran¬ 
sient  region  is  passed  and  the  method  is  in  the  smooth  region  we 
should  have  |hAj|>l  for  eigenvalues  A  ^  corresponding  to  the 
negligible  transients  of  the  numerical  solution.  For  these  com¬ 
ponents  stability  is  ensured  and  their  magnitude  will  remain 
negligible  relative  to  the  accuracy  requirement  and  hence  the 
local  error  must  be  mostly  in  the  direction  of  the  smooth  com¬ 
ponents.  It  is  not  obvious  that  this  will  also  be  the  case  for 
methods  which  don't  satisfy  this  assumption. 

Some  methods  require  an  analytic  Jacobian  while  others  can 
use  either  an  analytic  Jacobian  or  an  approximate  Jacobian 
based  on  numerical  differences.  Even  when  an  approximate  Jacobian 


is  allowed  the  performance  of  a  method  can  be  very  sensitive  to 
the  stepsize  used  in  determining  the  approximate  Jacobian  (see 
Gaffney  (1982))  and  this  can  be  very  disconcerting. 

6.  Progress  that  has  been  made. 

Any  comparison  of  stiff  methods  will  be  based  on  monitoring 
the  performance  of  methods  as  they  solve  a  representative  col¬ 
lection  of  stiff  problems  over  a  range  of  tolerances.  It  is 
clear  that  one  should  include  measures  of  reliability  as  well  as 
efficiency  when  describing  the  performance  of  a  method  and  it  is 
also  clear  that  there  should  be  several  measures  of  efficiency 
available.  The  relative  importance  of  these  efficiency  measure¬ 
ments  will  depend,  among  other  things,  on  the  size  and  com¬ 
plexity  of  the  differential  equation.  From  our  discussion  of 
the  ’objectives’  of  a  stiff  method  it  should  be  apparent  that 
the  choice  of  an  appropriate  reliability  measure  is  a  contro¬ 
versial  issue  and  while  a  few  possibilities  exist  none  is 
entirely  satisfactory.  In  our  early  comparisons  of  stiff  methods 
(Enright  et  al.  (1975))  we  gathered  a  collection  of  test  problems 
proposed  measures  of  reliability  and  efficiency,  and  reported  a 
comparison  of  several  modified  versions  of  existing  methods. 

The  modification  to  existing  methods  were  made  necessary  because 
of  the  variable  local  error  per  unit  step  control  we  assumed  in 
order  to  justify  our  choice  of  reliability  measure.  It  was 
clear  to  us  that  while  this  type  of  testing  could  be  used  to 
identify  weaknesses  of  a  particular  method  or  to  compare  modified 
versions  of  the  same  code  it  was  not  possible  to  identify  the 
'best*  method  in  any  sense.  Our  primary  goal  in  that  study  was 
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to  establish  a  testing  methodology  which  could  be  used  to  com¬ 
pare  the  performance  of  methods  on  any  class  of  stiff  problems. 
It  is  rather  unfortunate  that  the  main  impact  of  that  investi¬ 
gation  seems  to  have  been  that  the  test  problems  we  used  have 
been  used  as  a  'representative  set  of  stiff  problems'  in  several 
subsequent  investigations.  This  was  not  our  intention  as  the 
problem  set  was  proposed  more  to  illustrate  and  justify  the 
methodology  we  were  introducing.  Shampine  (1982)  has  recently 
made  several  suggestions  and  corrections  which  would  certainly 
be  necessary  if  these  test  problems  are  to  be  used  as  a  standard 
test  set. 

After  our  initial  testing  it  became  clear  that  while  com¬ 
parisons  of  methods  was  very  difficult  and  controversial  (since 
it  involved  modifying  methods)  the  assessment  of  an  individual 
method  was  feasible  and  much  less  controversial.  In  Enright 
(1980)  we  reported  on  the  use  of  testing  tools  we  had  developed 
to  automatically  assess  an  unmodified  stiff  method  on  any  class 
of  stiff  problems.  The  measurements  of  reliability  were  method- 
dependent  and  various  levels  of  detailed  statistics  could  be 
provided.  This  package  of  testing  tools  has  been  widely  distri¬ 
buted  and  results  obtained  using  this  package  for  a  selection 
of  contemporary  stiff  methods  are  summarized  in  Addison  (1982). 
As  Addison  notes  there  are  still  few  methods  which  operate 
efficiently  at  stringent  tolerances  (essentially  only  those 
based  on  variable-order  multistep  formulas)  while  at  larger 
tolerances  several  good  methods  are  available.  It  is  also  clear 
from  Addison's  results  that  a  robust  implementation  of  the  BDF 
formulas  such  as  the  well  known  code  GEAR  (Hindmarsh  (1973))  is 
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still  the  benchmark  that  new  codes  should  be  compared  to.  Its 
main  well-known  weakness  is  on  problems  with  eigenvalues  of  the 
Jacobian  near  the  imaginary  axis  and,  for  this  class  of  problems, 
several  acceptable  alternatives  are  available. 

In  the  future  we  feel  there  are  three  important  applications 
where  the  detailed  evaluation  of  a  stiff  method’s  performance 
will  prove  invaluable.  The  first  is  when  one  has  a  class  of 
problems  of  a  special  type  and  he  wishes  to  choose  an  appropriate 
method  which  will  then  be  adopted  for  use  in  his  application. 

An  interesting  example  of  such  a  study  is  that  of  Gaffney  (1982) 
where  he  reports  on  the  performance  of  a  selection  of  methods  on 
a  particular  stiff  oscillatory  problem. 

The  second  application  involves  the  comparison  of  different 
versions  of  the  same  basic  code  or  methods  with  very  similar 
strategies  and  objectives  on  general  stiff  problems.  The  former 
situation  arises  frequently  when  one  is  developing  a  method  and 
an  example  of  the  latter  is  the  comparison  of  GEAR  and  EPISODE 
(Byrne  et  al.  (1977)).  The  third  application  is  in  connection 
with  quantifying  the  costs  involved  and  the  potential  advantages 
in  exploiting  structure  when  solving  problems  of  a  special  type. 

As  an  example  of  such  an  application  we  have  used  (Kamel  and 
Enright  (1980))  the  testing  tools  described  above  to  quantify 
the  advantages  possible  when  one  automatically  partitions  a 
problem  with  only  a  few  transient  components  and  subsequently 
exploits  this  partitioning  in  the  linear  algebra  modules. 

In  summary  we  feel  that  while  a  comparison  of  stiff  methods 
on  general  stiff  problems  is  not  feasible,  one  can  compare  methods 
on  a  special  type  of  stiff  problems  or  versions  of  similar 
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methods  on  general  stiff  problems  with  far  fewer  difficulties. 
Automatic  testing  tools  and  the  collection  of  standard  test 
problems  can  make  this  a  straightforward  painless  process  which 
could  prove  essential  in  many  applications. 
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ABSTRACT 


Difficult  stiff  system  problems  encountered  at  LLNL  are  typified  by  those  arising 
from  various  atmospheric  kinetics  models,  which  include  reaction  kinetics  and  transport  in 
up  to  two  space  dimensions.  Approaches  devised  for  these  problems  resulted  in  several 
general  purpose  stiff  system  solvers.  These  have  since  evolved  into  a  new  systematized 
collection  of  solvers,  called  ODEPACK,  based  on  backward  differentiation  formulas  in  the 
stiff  case.  A  model  kinetics-transport  problem  is  used  to  illustrate  the  various  solvers. 
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Stiff  System  Problems  and  Solutions 
AtLLNL 


1.  introduction 

Initial  value  problems  for  systems  of  ordinary  differential  equations  (ODE's)  have  long 
been  a  topic  of  great  interest  at  LLNL.  Stiff  systems  are  particularly  prevalent  and  are, 
of  course,  much  more  challenging.  Applications  giving  rise  to  stiff  ODE  systems  vary- 
widely.  But  one  area  that  typifies  the  difficulties  encountered  is  that  of  atmospheric 
computer  models,  on  which  a  great  deal  of  effort  has  been  spent  at  LLNL  since  about 
1971.  These  problems,  in  most  cases,  take  the  form  of  systems  of  partial  differential 
equations  (PDE's)  in  space  ana  time,  involving  chemical  kinetics  and  transport  processes. 
A  discretization  process  leads  to  large  stiff  ODE  systems.  In  Section  2,  this  class  of 
problems  and  the  various  approaches  pursued  for  their  solution  are  described. 

The  problem  features  discussed  here  are  not  at  all  unique  to  this  particular 
application,  nor  to  problems  at  LLNL,  and  the  software  developed  for  their  solution  was 
designed  with  full  awareness  of  that  fact.  Thus  the  ODE  solvers  used  were  designed  to  be 
as  much  general  purpose  as  possible.  However,  a  great  deal  has  been  learned  in  the 
intervening  years  about  methods,  algorithms,  and  software  design  for  general  ODE 
solvers.  As  a  result,  a  new  collection  of  initial  value  solvers  has  recently  evolved  at 
Livermore— the  ODEPACK  collection.  There  are  currently  five  solvers  in  the  collection. 
They  are  based  on  Adams  methods  (nonstiff  case)  and  on  the  backward  differention  formula 
(BDF)  methods  (stiff  case),  and  also  on  an  inter-laboratory  effort  to  set  user  interface 
standards  for  initial  value  solvers.  These  are  described  in  Section  3. 

In  Section  4,  a  model  problem  on  kinetics-transport  type  is  used  to  illustrate  the 
capabilities  and  relative  merits  of  various  solvers,  including  both  those  in  ODEPACK  and 
older  cooes. 
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2.  Atmospheric  Model  Problems 

The  computer  modelling  of  various  atmospheric  chemistry  and  transport  process  has 
been  of  great  interest  in  the  context  of  (a)  ozone  depletion  from  supersonic  transport 
exhausts  in  the  stratosphere  [1,2],  (b)  stratospheric  ozone  aepletion  from  terrestial 
fluorocarbon  sources  13),  (c)  regional  air  pollution  in  the  lower  atmosphere  [4],  among 
others,  in  all  cases,  the  mathematical  moael  can  be  put  in  the  form  of  a  set  of 
time-aepenaent  POE's  in  space  (or,  as  a  special  case,  as  ODE'S  without  space  effects),  in 
the  concentrations  c1  of  the  various  chemical  species  of  interest.  These  PDE's  can  be 
written 

3c*/3t  =  7  •  (D7c‘  +  Vc*)  +  K‘  +  S1  (i=l,2,...,p)  ,  (1) 

where  D  is  a  matrix  of  diffusion  coefficients,  V  is  the  vector  of  mean  atmospheric 
motion,  K1  is  the  kinetics  rate  for  species  i  ,  and  S1  is  its  external  source  rate.  All  of 
these  quantities  can  in  general  depend  on  time  t,  on  the  point  in  space,  and  on  the 

1  D  T 

dependent  variable  vector  c  *  (c  ,...,c^)‘  .  However,  in  the  applications  of  interest,  D 

ana  V  ana  the  S1  aepend  only  on  space  and  possibly  time.  The  number  of  space 

dimensions  is  usually  2,  but  sometimes  1  or  0.  The  kinetics  rates  K1  usually  involve 

oiurnally  varying  rate  coefficients,  corresponding  to  photochemical  reactions  in  the  system. 

The  particular  kinetics  system  involved  varies  from  one  application  to  another,  but  in 

all  cases  there  is  a  mixture  of  fast  ana  slow  reactions  (large  ana  small  rate  coefficients), 

and  the  rates  are  nonlinear.  The  last  reactions  correspond  to  strong  damping  effects  with 

short  time  constants,  i.e.  they  cause  stiffness.  This  means  that  the  ODE  problem  will  be 

stiff  regardless  of  the  transport  processes  in  the  model,  although  they  can  also  contribute 

to  stiffness.  The  small  time  constants  from  the  kinetics  are  usually  in  the  microsecond 

range  or  smaller,  while  the  time  range  of  interest  is  usually  measured  in  years.  Thus 

1 2 

stiffness  ratios  exceeding  10  are  common  in  these  problems. 
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Historically,  the  spatial  operator  in  (1)  has  usually  been  treated  by  finite  difference 
approximations,  because  of  their  simplicity  and  their  long  history  of  use.  In  2-D,  a 
rectangular  mesh  is  usea.  Actually,  however,  there  is  nothing  inherent  in  the  problem  to 
prevent  the  use  of  a  finite  element,  collocation,  or  Galerkin  treatment,  although  in  the 
latter  case  the  local  integrals  pose  some  difficulty. 

To  illustrate  the  spatial  differencing  process,  consider  the  one-dimensional  operator 

Lc  lD(x)  g  +  V(x)c]  .  (2) 

On  a  mesh  x^  <  x^  <  ...  <xM  ,  the  standard  central  differencing  of  (2),  in  terms  of  discrete 
values  c^=  c(x-) ,  is 


Lc 


k= 


0(xi+l/2 J  Cx  (xi+l/2 )  "  D(xj-l/2)cx(xi-l/2) 


txi+l  “  Xi-lJ/2 


V<XU1)C1*1  -  V(xi-l)c1-1 
xi+l  "  xi-l 


(3) 


where,  by  definition, 

Vl/2  *  <*i  *  Vl)/2- 

-  V  • 

and  similarly  at  *  The  differencing  process  is  handled  in  a  similar  way  for  two 

dimensional  operators.  Boundary  conditions  are  similarly  approximated  in  discrete  form. 
For  example,  a  zero  normal  derivative  boundary  condition,  say 


*k 


=  0  , 


is  approximated  by  setting  c  =  c.  in  the  ODE’s  corresponding  to  xsx. 

O  A  1 
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In  this  way,  a  set  of  p  species  PDE's  becomes  a  coupled  stiff  system  of  ODE's  whose 
size  is 

N  =  p  in  0-D  (space-independent), 

N  *  pM  in  1-D  (mesh  size  M), 

N  =  pM  M  in  2-D  (mesh  size  M  by  M  ) 

a  y  *  y 

It  is  clear  that  for  a  reasonably  realistic  model  in  2-D,  the  system  size  can  easily  exceed 

10,U00. 

In  1971,  the  only  reliable  stiff  system  solver  available  (to  us)  was  C.  W.  Gear's 
DIFSUti.  This  routine  was  installed,  modified  and  improved,  ana  used  as  the  GEAR  package 
15J.  For  0-D  problems  or  small  1-D  problems,  it  workea  well,  and  producea  accurate 
answers  with  efficiencies  that  were  quite  impressive  when  compared  with  anything  else 
tried  (including  a  number  of  nonstiff  methods,  before  the  nature  of  stiffness  was 
recognized).  The  GEAR  package  (like  DIFSUB),  also  had  a  nonstiff  method  option  (using 
Adams  methods),  so  that  it  was  widely  usable  (and  used)  as  a  general  purpose  ODE  solver. 
Still,  the  GEAR  package  gave  little  hope  of  solving  the  full  atmospheric  models,  because  it 
had  to  construct,  and  perform  LU  factorizations  on,  full  N  x  N  matrices,  followed  by 
backsolve  operations  for  solutions  of  linear  systems.  If  the  ODE  system  is  written 

y  =  dy/dt  =  f(t,y) ,  (4) 

ana  has  a  system  Jacobian  matrix 

J  =  3f/3y  ,  (5) 

then  the  linear  systems  take  the  form 

Px  =  (bDF  resiaual  vector),  (6) 

where  x  is  a  correction  vector  ana  P  is  an  approximation  to  I-h0oJ  .  Here  I  denotes  the 
identity  matrix,  h  is  a  time  step  size,  and  8Q  is  a  scalar  depending  only  on  the  current 
methoa  oraer.  The  NxN  system  (6)  occurs  within  a  modified  Newton  iteration  to  solve 
the  implicit  BDF  relation. 

The  next  step  was  the  realization  that  our  problems  had  a  very  sparse  Jacobian 
matrix,  ana  that  the  LU  method  was  extendable  to  sparse  matrices.  This  was  clearest  if 
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J  was  thought  of  as  banded,  and  so  a  variant  solver,  with  J  treated  as  banded,  was 
written,  and  called  GEARB  16 J.  This  approach  could  be  applied  to  the  atmospheric 
problems  if  the  dependent  variables  in  the  ODE  system  were  correctly  ordered.  The 
appropriate  ordering  is  to  group  together  all  p  values  c1  at  one  node,  then  all  the  c1  at 
the  next  node,  and  so  on,  with  a  natural  linear  (in  1-D)  or  rectangular  (in  2-D)  ordering  of 
the  nodes.  Ordering  in  the  reverse  manner  (by  nodes,  then  by  species)  may  seem  natural, 
but  produces  much  larger  bandwidths,  as  long  as  p  is  small  compared  to  either  of  the 
mesh  dimensions.  In  two  dimensions,  the  bandwidth  is  also  minimized  by  numbering  the 
nodes  in  the  shorter  direction  first  (if  there  is  one).  GEARB  was  used  successfully  in 
solving  the  1-D  models  of  interest,  and  small  2-D  models. 

The  full  2-D  models  still  seemed  out  of  reach.  Combining  GEARB  with  reordering 
algorithms  to  reduce  bandwidth  failed  because  of  the  regular  structure  of  the  problem.  A 
general  sparse  LU  approach  seemed  inappropriate  (though  it  was  not  thoroughly  pursued) 
because  matrix  fill-in  would  result  in  much  the  same  storage  costs  as  for  the  banded 
treatment. 

Inasmuch  as  the  matrix  elements  in  (6)  are  rather  easily  generated,  the  idea  of 
iterative  linear  system  methods  was  a  natural  next  choice.  For  example,  successive 
overrelaxation  (SOR),  with  careful  attention  to  the  choice  of  relaxation  parameter,  was 
known,  both  theoretically  and  experimentally,  to  do  well  on  linear  systems  based  on 
transport  PDE's  of  the  type  (1),  with  a  single  species  and  no  kinetics.  The  implementation 
of  SOR  for  (6)  requires  that  P  be  written  as 

P  =  L  +  D  +  U  with  (7) 

L  strictly  lower  triangular, 

D  diagonal  and  nonsingular, 

U  strictly  upper  triangular, 
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that  one  can  generate  (or  access  from  memory)  the  elements  of  D  »  and  that  one  can 
easily  generate  matrix-vector  products  (L+U)x.  Specifically,  for  a  given  relaxation 
parameter  u> ,  the  vth  SOR  iteration  in  the  solution  of  Px  =  r  is  given  by 

(D  +  wL)xv+l  =  ur  +  (l-tu)Dxv  -  uUxv  .  (8) 

This  can  be  rephrased  as 
x'*1  =  (1-oj)xv  +  uz, 

Dz  =  r  -  Uxv  -  Lx'*1  . 


This  pair  of  equations  appears  circular,  but  is  not;  z  is  computed  and  xv  updated  to  xv+1 
one  component  at  a  time  according  to  („;. 

For  the  atmospheric  problems,  the  coupling  induced  by  the  kinetics  does  not  even 
come  close  to  satisfying  the  conditions  needed  for  ordinary  SOR.  Thus,  to  have  any  hope 
for  convergence,  it  is  necessary  to  treat  the  c  vector  (of  length  p)  as  a  unit,  and  use  a 
block-SOR  algorithm.  This  results  from  replacing  scalars  in  SOR  by  matrix  or  vector 
blocks  of  size  p.  Thus  one  must  write 

P  *  L  +  D  +  11  with  (10) 

L  strictly  lower  block-triangular, 

D  block  diagonal  and  nonsingular, 

U  strictly  upper  block-triangular. 

One  must  again  generate  products  (L  +  U)x  ,  and  one  must  generate  D  and  be  able  to 
solve  p  x  p  systems  with  the  diagonal  blocks  in  D.  This  can  be  easily  done  by  forming 
ana  using  LU  factorizations  of  those  blocks.  Equations  (9)  also  provide  an  algorithm  for 
the  block-SOK  iteration,  by  updating  xv  in  blocks  (from  first  to  last). 

For  a  2-D  atmospheric  model,  a  special  block-SOR  variant  of  GEAR  was  written,  and 
later  made  into  a  general-purpose  solver,  called  GEARB1  [7]  (BI  denoting  Block-Iterative). 
This  was  soon  modified  to  take  advantage  of  Large  Core  Memory  (LCM)  storage  on  the 
C DC-7600  computer,  resulting  in  a  package  called  GEARBIL.  The  latter  stores  most  of  the 
large  data  arrays  in  LCM,  the  largest  being  the  one  containing  the  matrix  D  in  (10)  (and 
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later  its  LU  decomposition),  ot  size  p  M  M  .  These  two  solvers,  as  they  stand,  are  still 

x  y 

somewhat  specialized  to  the  atmospheric  models,  in  that  the  L  and  U  matrices  in  (1U) 
have  a  regular  block  structure  (in  p  x  p  blocks),  with  each  block  being  a  scalar  multiple  of 
the  p  x  p  identity  matrix.  (The  latter  is  due  to  the  fact  that  D  ano  V  are  independent 
of  c  in  (1).)  However,  it  is  a  straightforward  matter  to  modify  the  codes  to  accommodate 
a  more  general  coupling,  and  this  has  been  done  in  at  least  two  instances. 

The  GEAKBIL  package  was  used  for  a  variety  of  studies  of  kinetics  in  the 
stratosphere,  including  ozone  depletion  from  SST's  [1,2]  and  from  fluorocarbons  [3],  with 
the  two  dimensions  being  altitude  and  latitude.  In  most  cases,  the  mesh  sizes  were  37  x 
44  (1628  spatial  nodes),  and  the  number  of  chemical  species  p  was  9,  giving  an  ODE 
system  size  of  N  =  14,652.  GEARBIL  was  subsequently  used  in  several  similar  models 
developed  for  regional  air  quality  calculations  [4],  with  the  third  dimension  (altitude) 
accounted  for  by  assuming  uniformity  between  the  ground  and  the  temperature  inversion 
layer.  One  of  these,  LIKAQ2,  is  currently  used  to  model  the  San  Francisco  Bay  Area  air 
layer,  as  an  ongoing  pollution  control  tool  of  regional  government. 

In  the  stratosphere  studies,  one  of  the  lessons  learned  concerns  the  tolerance  on  the 
part  of  the  ODE  solver  for  errors  in  the  Jacobian  matrix,  when  supplied  by  the  user.  For 
some  time,  routines  to  supply  J  were  written  by  hand,  and  therefore  subject  to  error, 
especially  for  these  complicated  systems.  When  plots  of  step  size  history  were  generated, 
and  in  one  case  published  [l,p.58],  they  often  showed  great  irregularities,  as  if  frequent 
instabilities  or  problem  discontinuities  were  forcing  drastic  reductions  in  step  size 
periodically.  Subsequently,  with  the  aid  of  automated  Jacobian  generators,  errors  in  J 
were  found  and  corrected,  and  reruns  of  these  problems  showed  a  remarkably  smooth, 
nearly  monotone,  growth  in  step  size. 

Independently  of  the  development  of  complex  transport  models  in  1-D  and  2-D,  the 
kinetics  mechanisms  of  the  lower  and  upper  atmosphere  were  studied.  In  particular,  the 
detailed  kinetics  processes  of  the  stratosphere,  with  diurnal  effects  induced,  required 


7 


much  effort  in  both  the  model-budding  and  the  numerical  solution  phases,  even  in  the 
absence  of  spatial  transport,  i.e.  in  U-D.  The  stiff  ODE  systems  that  arise  are  further 
complicated  by  the  fact  that  rate  coefficients  for  photochemical  reactions  follow  a  nearly 
square-wave  pettern  in  response  to  sunlight  [8]. 

In  many  cases,  these  diurnal  kinetics  problems  were  found  to  cause  great  difficulty  for 
the  GEAR  package,  sometimes  causing  it  to  crash  irrecoverably.  It  was  found  that  this  is 
due  to  the  buildup  of  errors  associated  with  the  use  of  fixed-step  BDF's  with  interpolator 
step  changing.  As  a  result,  variable-step  forms  of  the  BDF's  were  developed,  jointly  with 
G.  Byrne  l9j.  These  were  implemented  in  a  general  solver  called  EPISODE  [10,11],  along 
with  variable-step  Adams  methods  for  non-stiff  problems.  EPISODE  resembles  GEAR 
externally,  but  differs  internally  in  all  details  associated  with  integration  coefficients, 
error  estimation,  and  step  selection.  EPISODE  was  found  to  be  able  to  handle  the  diurnal 
prooiems  quite  reliably,  although  it  was  usually  somewhat  less  efficient  than  GEAR  on 
problems  with  smooth  solutions  [12].  A  banded  Jacobian  variant  EPISODEB  [13]  was  also 
written,  to  accommodate  diurnal  kinetics-transport  problems. 

After  most  of  the  model  building  for  these  atmospheric  problems  was  completed,  the 
question  of  alternative  space  discretizations  was  nevertheless  studied  to  some  extent.  N. 
Madsen  and  R.  Sincovec  showed  that  collocation  methods  could  be  used  quite  effectively  on 
1-D  problems  of  this  type,  and  developed  a  general  purpose  PDE  package,  called  PDECOL, 
from  that  idea  [14].  In  the  process,  it  was  clear  that  a  different  type  of  ODE  solver  was 
needed,  namely  one  which  would  treat  linearly  implicit  systems, 

Ay  =  glt,y)  (11) 

(A  a  square  matrix),  in  a  direct  and  efficient  manner.  To  this  end,  another  GEAR  variant, 
GEAR1B,  was  written  [15]  (lb  for  Implicit  systems,  Banded  matrix  treatment),  and  a 
modified  form  of  this  is  used  in  PDECOL.  Later,  an  analogous  EPISODE  variant, 
EPISODEIB,  was  also  written  [16].  These  two  ODE  solvers  are  intended  mainly  for  the 
case  of  a  nonsingular  A  matrix  in  (11),  which  is  the  most  common  situation  in  PDE-based 
problems,  but  can  also  be  used  in  the  singular  case,  if  used  with  caution. 


8 


3.  The  ODEPACK  Solvers 

The  D1FSUB,  GEAR*  ana  EPISODE  packages  were  added  to  a  list  of  available  general 
purpose  initial  value  solvers  that  was  growing  quite  sizable  by  1975.  The  length  and 
diversity  of  this  list  caused  some  concern  to  users  and  software  developers  alike.  There 
was  much  duplication  of  capabilities  offered,  but  at  the  same  time  there  was  very  little  in 
common  among  the  solvers  in  terms  of  either  their  external  appearance  or  their  internal 
structure.  This  situation  was  in  sharp  contrast  to  that  in  other  areas  in  which 
"systematized  collections"  of  Fortran  routines  were  being  developed.  The  earliest 
examples  were  E1SPACK  [171,  for  computing  matrix  eigensysterns,  UNPACK  118],  for 
solving  linear  systems,  and  FUN  PACK,  for  certain  special  functions. 

3.1  The  ODEPACK  Concept 

The  idea  of  a  systematized  collection  of  initial  value  ODE  solvers,  tentatively  called 
ODEPACK,  was  discussed  informally  as  early  as  1974,  in  workshops  attended  by  people 
from  all  over  the  world  [19).  However,  it  was  quickly  realized  that  the  task  was  much 
larger  in  the  ODE  case  than  in  other  areas,  partly  because  of  the  complexity  of  the 
subject,  ana  partly  because  of  widely  divergent  views  of  what  ODEPACK  should  look  like. 
Starting  in  197b,  attempts  were  made  to  reduce  the  problem  by  involving  only  people  at 
U.b.  Department  of  Energy  laboratories,  and  LLNL  received  funding  to  study  the 
feasibility  of  ODEPACK  from  the  Applied  Mathematical  Sciences  Research  Program  under 
the  Office  of  Basic  Energy  Sciences  in  DOE. 

The  natural  first  step,  and  a  necessary  preliminary  to  any  actual  development  of  an 
ODEPACK,  is  the  setting  of  standards  for  the  interface  between  the  user  and  the  ODE 
solvers.  The  user  interface  to  a  solver  consists  mainly  of  the  call  sequence  of  the  routine 
the  user  must  call,  together  with  definitions  of  the  one  or  more  user-supplied  routines 
called  by  the  solver.  To  the  extent  that  solvers  for  various  problem  types  and  using 
various  methods  must  all  communicate  certain  specific  things  to  and  from  the  user,  it  is 
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possible  to  formulate  a  loose  set  of  standards  for  the  user  interface.  An  early  proposal  is 
given  in  [20].  A  sequence  of  workshops  and  discussions  on  user  interface  standards  for  ODE 
solvers  succeeded  in  producing  a  reasonable  consensus  in  1978  [21,22].  This  tentative 
interface  standard  was  achieved  only  through  considerable  compromise  by  the  various 
participants,  which  included  ODE  software  authors  and  users  at  various  DOE  laboratories. 

At  that  time,  it  was  agreed  that  several  of  the  more  popular  ODE  solvers,  including 
GEAR,  GEARB,  DE/STEP  [23]  and  RKF45  [24],  would  be  rewritten  to  conform  with  the 
tentative  standard  interlace,  resulting  in  a  small  collection  that  was  at  least  systematized 
in  its  external  appearance.  The  first  result  of  that  agreement  was  a  package  based  on  the 
GEAR  and  GEARB  packages,  called  LSODE  (Livermore  Solver  for  ODE's)  [25].  The  LSODE 
solver  and  variants  of  it  written  subsequently  (all  in  accordance  with  the  tentative  standard 
interface  121],  with  minor  modifications)  are  briefly  described  in  the  following  subsections. 
In  the  meantime,  unfortunately,  the  other  software  authors  involved  withdrew  from  the 
agreement,  and  so  this  collection  does,  not  yet  have  analogous  rewritten  versions  of  their 
codes. 

3.2  LSODE 

LSODE  combines  the  capabilities  of  GEAR  and  GEARB.  Thus  it  solves  explicitly  given 
stiff  and  nonstiff  systems  y  =  f(t,y)  ,  and  in  the  stiff  case  it  treats  the  Jacobian  as  either 
full  or  banded,  and  as  either  user-supplied  or  internally  approximated  by  difference 
quotients.  By  comparison  with  GEAR  and  GEARB,  LSODE  offers  a  number  of  new  features 
that  make  it  more  convenient,  more  flexible,  more  portable  and  easier  to  install  in 
software  libraries.  Some  of  these  are  the  following: 

(a)  Through  the  redesigned  user  interface,  many  new  options  and  capabilities 
are  available,  and  others  are  much  more  convenient  than  before.  Some 
examples  are — more  flexible  error  tolerance  parameters,  independent 
flags  for  starting  and  stopping  options,  internally  computed  initial  step 
size,  two  work  arrays  in  the  call  sequence  for  all  internal  dynamic  work 
space,  user  names  for  f  and  J  in  the  call  sequence,  easy  changing  of 
input  parameters  in  mid-problem,  convenient  optional  inputs  (such  as 
maximum  method  order),  convenient  optional  outputs  (such  as  step  and 
function  evaluation  counts),  optional  provision  of  derivatives  of  the 
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solution  (oi  various  oraers)  at  any  point,  and  real  and  integer  user  data 
space  (of  dynamic  length)  available  in  the  f  and  J  routines  (with  no 
extra  burden  on  the  casual  user). 

(b)  The  user  documentation,  which  is  contained  in  the  initial  comment  curds 
of  the  source,  is  given  in  a  two-level  form.  A  short  and  simple  set  of 
instructions,  with  a  short  example  program,  is  given  first,  for  the  casual 
user.  Then  detailed  instructions  are  given  for  users  with  special  problem 
features  or  a  desire  for  nonstanaard  options.  The  latter  is  also  organized 
so  as  to  allow  selective  reading  by  a  user  who  wants  only  a  fraction  of  the 
nonstandard  capabilities. 

(c)  When  stiff  options  are  selected,  linear  systems  are  solved  with  routines 
from  LI^PACK  (18J,  which  is  becoming  a  widely  accepted  standard 
collection  of  linear  system  solvers. 

(o)  Some  retuning  of  various  heuristics  was  done  so  that  performance  should 
be  more  reliable  than  for  GEAR/GEARB. 

(e)  The  core  routine  which  takes  a  single  step,  called  STODE,  is  independent 
of  the  way  in  which  the  Jacobian  matrix  (if  used)  is  treated.  Thus  as 
variant  versions  of  LSODE  are  written  for  other  matrix  structures  (such 
as  LSODES),  these  will  share  the  same  subroutine  STODE. 

(f)  The  writing  of  all  error  messages  is  done  in  a  small  isolated 
general-purpose  message  handler  called  XERRWV.  Two  other  small 
subroutines  are  user-callable  for  optional  changing  of  the  output  unit 
number  and  optional  suppression  of  messages.  This  trio  of  routines  is 
compatible  with  a  much  larger  error  package  (the  SLATEC  Error 
Handling  Package)  written  elsewhere  [26J. 

(g)  LSODE  easily  allows  a  user  to  interrupt  a  problem  and  restart  it  later 
(e.g.  in  switching  between  two  or  more  ODE  problems).  Also,  using 
LSODE  in  overlay  mode  is  very  easy,  with  no  loss  of  needed  local  variables. 

(h)  The  various  lists  of  constants  needed  for  the  integration,  formerly 
appearing  in  a  subroutine  called  COSET,  are  now  computed  (once  per 
problem).  This  adds  to  the  portability  of  LSODE. 


3.3  LSOD1 

The  LSOD1  solver  125],  written  jointly  with  J.  F.  Painter  (LLNL), treats  systems  in  the 
linearly  implicit  form  A(t,y)y  =  g(t,y)  ,  where  a  is  a  square  matrix.  Many  problems, 
including  PDE’s  treated  by  finite  elements  and  the  like,  result  in  such  systems,  and  it  is 
almost  always  more  economical  to  treat  the  system  in  the  given  form  than  to  convert  it  to 
an  explicit  form  y  =  f  .  LSODI  allows  A  to  be  singular,  but  the  user  must  then  input 
consistent  initial  values  of  both  y  and  y  .  In  the  singular  case,  the  system  is  a 
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differential-algebraic  system,  ana  then  the  user  must  be  much  more  cautious  about 
formulating  a  well-posed  problem,  as  well  as  in  using  LSODI,  which  was  not  designea  to  be 
robust  in  this  case.  LSODI  is  based  on  (and  supersedes)  the  GEARIB  package,  but  corrects 
a  number  of  deficiencies,  as  follows: 

(a)  The  matrices  involved  can  be  treated  as  either  full  or  banded,  by  use  of 
the  method  flag. 

(b)  The  dependence  of  A  on  y  is  automatically  and  inexpensively  accounted 
for,  whether  partial  derivatives  are  supplied  by  the  user  or  computed 
internally  by  difference  quotients. 

(c)  When  A  is  singular,  the  user  needs  to  supply  only  the  initial  value  of 
ay/dt ,  and  this  array  (along  with  the  initial  y)  is  passed  through  the  call 
sequence,  rather  than  computed  in  a  user-replaceable  package  routine. 
(Admittedly,  correct  initial  data  can  be  difficult  to  obtain  for  some  types 
of  problems.)  When  the  initial  dy/dt  is  not  being  supplied,  an  input  flag 
instructs  LSODI  to  compute  it  on  the  assumption  that  A  is  initially 
nonsingular.  Thereafter,  no  such  assumption  is  made,  but  ill-conditioning 
can  be  a  problem  when  A  is  singular. 

(a)  The  user-supplied  residual  routine  includes  a  flag  which  allows  the  user  to 
signal  either  an  error  condition  or  an  interrupt  condition. 

(e)  To  the  maximum  extent  possible,  LSODI  shares  the  same  user  interface  as 
LSODE,  and  so  reflects  all  the  advantages  over  GEARIB  that  LSODE  has 
over  GEAR  and  GEARB. 

The  differences  between  the  LSODI  and  LSODE  user  interfaces  occur  primarily  in  the 
user-supplied  subroutines.  With  LSODI,  one  must  supply  a  routine  to  compute  the  residual 
function  r  =  g(t,y)  -  A(t,y)s  for  a  given  t  ,  y  ,  and  s  ,  and  another  routine  to  add  the 
matrix  A  to  a  given  array.  Optionally,  the  user  can  supply  a  routine  to  compute  the 
Jacobian  matrix  3r/3y  . 


By  virtue  of  the  modular  and  systematized  organization  of  LSODE  and  LSODI,  the  two 
packages  share  most  of  their  routines  with  each  other. 


Some  examples  of  the  use  of  LSODE  and  LSODI  on  systems  arising  from  PDE  problems 
can  be  found  in  127]  and  (28].  In  the  latter,  experiments  by  Painter  on  incompressible 
N  avier-Stokes  problems  shed  some  light  on  the  difficulties  involved  with 
differential-algebraic  systems. 
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3.4  LSODES 


The  LSODES  package  solves  explicit  systems  y  =  f  ,  Out  treats  the  Jacobian  matrix  J 
as  a  general  sparse  matrix  in  the  stiff  case.  LSODES  was  written  jointly  with  A.  H. 
Sherman  (Exxon  Production  Research  Co.),  and  supersedes  a  sparse  variant  of  GEAR 
called  GEARS  129, 30).  In  LSODES,  the  linear  systems  (6)  are  solved  using  parts  of  the  Yale 
Sparse  Matrix  Package  (YSMP)  [31,32].  This  involves  several  phases: 

(a)  Determination  of  sparsity  structure.  This  is  either  inferrea  from  calls  to 
the  f  routine,  inferred  front  cailsfto  a  J  routine  (if  one  is  supplied),  or 
supplied  directly  oy  the  user.  A  user  input  flag  determines  which  is  aone. 

(b)  Determination  of  pivot  order.  Diagonal  pivot  locations  are  chosen,  and  the 
choice  is  based  on  maximizing  sparsity.  This  is  done  by  YSMP. 

(c)  symbolic*  LU  factorization  of  the  matrix  P  .  This  is  based  only  on  sparsity 
and  the  pivot  order,  and  uses  the  moaule  in  YSMP  aesigned  for 
nonsymmetric  matrices  with  compressec  pointer  storage. 

(d>  Construction  of  j  .  This  can  be  done  internally  by  difference  quotients, 
or  with  a  user-supplied  routine,  in  the  difference  quotient  case,  the 
number  of  f  evaluations  needed  is  kept  to  a  minimum  by  a  column 
grouping  technique  due  to  Curtis,  Powell,  and  Reid  [33].  In  the  other  case, 
the  user-supplied  routine  provides  one  column  of  J  at  a  time,  in  the  form 
of  a  vector  of  length  N  (although  only  non-zero  elements  need  be 
computed  and  stored),  so  that  users  need  never  deal  with  the  internal  data 
structure  for  J  and  p  .  in  any  case,  J  is  stored  internally  in  an 
appropriate  packed  form.  Evaluations  of  J  are  done  only  occasionally,  as 
explained  below. 

(e)  Construction  of  P  =  I  -  hg0J  .  In  contrast  to  GEARS,  LSODES  does  not 
force  a  re-evaluation  of  J  whenever  the  existing  P  is  deemed  unsuitable 
for  the  corrector  iterations.  Instead,  when  the  value  of  J  contained  in 
the  stored  value  of  P  is  likely  to  be  usable  (and  P  is  not,  only  because 
h£0  has  changed  significantly),  then  a  new  matrix  P  is  constructed  from 
the  old  one,  with  careful  attention  to  roundoff  error.  This  cuts  down 
greatly  on  the  number  of  J  evaluations  necessary. 

(f)  Numerical  LU  factorization  of  p  .  This  is  done  by  YSMP  in  sparse  form, 
ana  the  array  containing  P  is  saved  in  the  process  (this  allows  for 
updating  P  as  described  above).  Because  of  the  absence  of  partial 
pivoting  for  numerical  stability,  this  operation  can  conceivably  fail. 
However,  this  has  only  rarely  been  observed  in  practice,  and  if  it  does 
occur  (with  a  current  value  of  J  ),  the  step  size  h  gets  reduced  and  the 
problem  aisappears. 

(g)  Solution  of  Px  =  r  .  This  is  done  by  YSMP  using  the  existing  sparse 
factorization  of  P  .  Because  a  modified  Newton  iteration  is  used,  many 
values  of  r  (i.e.,  many  linear  systems)  can  arise  for  the  same  P  ,  and  the 
separation  of  the  various  phases  takes  advantage  of  that  fact. 


Tne  first  three  phases,  and  part  of  the  fourth  (column  grouping  for  difference 
quotients),  are  normally  done  only  at  the  start  of  the  problem.  However,  the  user  can 
specify  that  the  sparsity  structure  is  to  be  reueterinined  in  the  middle  of  the  problem,  and 
then  these  operations  are  repeated. 

T 

Actually,  the  matrix  operated  on  by  YSMP  is  A  =  P  ,  not  P  ,  because  P  is 

generated  in  column  oraer  while  YSMP  requires  the  matrix  to  be  described  and  stored  in 

row  order.  This  causes  no  difficulty,  however,  because  YSMP  includes  a  routine  for  solving 

T  T 

the  transpose  problem  x  A  =  r  (which  is  equivalent  to  Px  =  r  )  as  well  as  for  the  direct 
problem  Ax  =  b  . 

A  package  called  LSOD1S,  similar  to  LSODI  (for  the  Ay  •  g  problem)  but  using  YSMP 
for  general  sparse  treatment  of  matrices  as  in  LSODES,  is  in  the  process  of  being  written. 

3.5  LSODA 

LSODA  is  a  variant  ol  LSODE  of  yet  another  kind.  It  was  written  jointly  with 
L.  R.  Petzold  (Sanaia-Livermore),  and  switches  automatically  between  nonstiff  (Adams) 
ana  stiff  (bDF)  methods,  by  an  algorithm  aeveloped  by  Petzold  134].  (The  suffix  A  is  for 
Automatic.)  Thus  it  is  more  convenient  than  LSODE  for  users  who  would  rather  not  be 
bothered  with  the  issue  of  stiffness.  Also,  it  is  potentially  more  efficient  than  LSODE 
(when  used  with  a  fixed  method  option),  when  the  nature  of  the  problem  changes  between 
stiff  ana  nonstiff  in  the  course  of  the  solution.  In  place  of  the  method  flag  parameter  of 
LSODE,  the  user  of  LSODA  supplies  only  a  Jacobian  type  flag.  The  storage  space  supplied 
to  the  solver  can  be  either  static  (and  thus  allow  for  either  the  stiff  or  nonstiff  method),  or 
dynamic  (and  altered  each  time  there  is  a  method  switch,  to  an  amount  specified  by  the 
solver). 


3.6  LSODAR 

LSODAK  combines  the  capabilities  of  LSODA  with  a  rootfinder.  It  allows  one  to  find 
the  roots  of  a  set  of  functions  g.(t,y)  of  the  independent  and  dependent  variables  in  the 
ODE  system.  Thus,  for  example,  it  could  be  used  in  a  particle  tracking  problem  to 
determine  when  a  particle  path  reaches  any  of  the  walls  of  a  container.  LSODAR  was  also 
written  jointly  with  L.  K.  Petzold  based  on  an  algorithm  [35]  developed  by  K.  Hiebert  and 

L.  F.  shampine  (Sandia-Albuquerque).  The  user  must  supply,  in  addition  to  the  LSODA 

*?• 

inputs,  a  subroutine  that  computes  a  vector-valued  function  g(t,y)  =  (g^,  i=l,2,...,NG)  such 
that  a  root  of  any  of  the  NG  functions  gi  is  desired.  Of  course  there  may  be  several 
such  roots  in  a  given  interval,  and  LSODAR  returns  them  one  at  a  time,  in  the  order  in 
which  they  occur  along  the  solution,  with  an  integer  array  to  tell  the  user  which  g-  (if 
any)  were  found  to  have  a  root  on  a  given  return. 

4.  An  Example  Problem 

in  order  to  illustrate  the  various  solvers  (new  and  old)  described  above,  and  to 
demonstrate  their  relative  merits  on  a  realistic  problem,  we  consider  here  an  example 
problem.  The  problem  is  a  simple  atmospheric  model  [30]  with  two  chemical  species 
undergoing  diurnal  kinetics  and  transport  in  two  space  dimensions.  The  independent 
variables  in  the  PDE  system  are  horizontal  position  x  ,  altitute  z  (both  in  kilometers), 
and  time  t  (in  sec),  with  0  <_x  <.20,  30  ^z  <_50,  0  j^t  86400  (1  day).  The  dependent 
variables  are  c  (x,z,t)  =  the  concentration  of  the  oxygen  singlet  [O],  and  c  (x,z,t)  =  that 

O 

of  ozone  lOtj]  (both  in  moles/cm  ).  The  concentration  of  molecular  oxygen  [02 ]  is  assumed 
constant.  The  equations  of  the  model  are: 

c*  =  (Kv(z)<4^  +  *Vi4x  +  R^.c^t)  (i=l,2),  (12) 

where  R1  and  R2  represent  the  chemistry  ano  are  given  by 
R^(c^,c2,t)  =  -(ki+k2C2)cl  +  kjttjc2  +  k4(t)»7.4»1016 
R2(cl,c2,t)  =  (ki-k2c2)cl  -  k3(t)c2  . 


Subscripts  t ,  z  ,  ana  x  denote  partial  derivatives.  The  various  coefficients  are  as  follows: 
hvte)  *  1(T®  *  explz/5),  Kh  =  4*llT6  ,  k*  =  6.03,  k2  *  4.66*10“16, 


{exp  (-7. 
0 


,-7.60l/sin(nt/432U0)) 


exp  (-22.62/sin(nt/43200)) 


t  <  43200  ^ 
t  >.43200  j  * 

t  <  43200  ^ 
t  >_  43200  J 


The  initial  conditions  are 


*  K^)4]  t-(^) 2  *  K^)‘J 

(i  *  1,2)  , 

1  2 

and  both  c  ana  c  are  required  to  satisfy  homogeneous  Neumann  boundary  conditions 
along  ail  the  x  ana  z  boundaries. 

To  solve  the  system  (12)  numerically,  we  apply  the  method  of  lines  using  a  regular 
rectangular  mesh  with  constant  mesh  spacings 

ox  =  20/(M  -1)  ,  nz  =  20/(M_-l)  . 

X  it 

The  spatial  derivatives  are  approximated  by  standard  5-point  central  differences,  as  given 
by  (3)  in  each  direction.  The  boundary  conditions  are  similarly  replaced  by  difference 
relations.  The  resulting  ODE  system  y  =  f(t,y)  has  size  N  =  2M„M„  .  The  initial  value 
vector  yQ  is  taken  from  the  initial  condition  functions  given  above.  The  system  Jacobian 
J  is  sparse,  with  roughly  12M  M,  =  6N  nonzero  elements.  As  a  band  matrix,  with 
component  ordering  first  by  species,  then  by  x  ,  and  lastly  by  z  ,  it  has  a  half-bandwidth 
of  2ta  ,  and  thus  a  full  bandwidth  of  4M  +1 . 

X  X 
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As  a  nominal  case,  consider  the  choice  M  =  M„  =  10  .  As  to  accuracy,  a  crude  model 

of  this  type  calls  for  no  more  than  a  few  significant  figures.  To  he  conservative  in 

recognizing  that  tolerance  parameters  are  applied  to  local  errors,  which  can  accumulate 

into  global  error,  we  might  impose  a  local  relative  tolerance  of  10  .  We  must  also 

specify  a  positive  absolute  tolerance  on  the  values  of  c  *  because  it  decays  to  negligible 

-2 

values  at  night.  A  reasonable  absolute  tolerance  is  10  .  With  the  ODEPACK  solvers, 

specifying  such  a  mixed  relative/  absolute  error  control  is  trivial,  but  with  the  GEAR  and 
EPISODE  families,  a  slight  modification  to  the  driver  is  necessary. 

Of  the  various  solvers  mentioned,  five  are  suitable  for  this  particular 
problem— LSODE,  LSODA,  LSODES,  EP1SODEB,  ana  GEARB1  .  Recall  that  LSODES  uses  a 
general  sparse  treatment  of  the  Jacobian  matrix,  GEAKB1  uses  block-  SOR,  and  the  others 
(in  this  case)  treat  the  Jacobian  as  banded.  The  problem  was  set  up  for  each  of  these  five 
solvers  and  run  on  a  C DC-7 600  computer.  For  all  but  GEARBI,  both  the  user-supplied 
Jacobian  option  ana  the  internal  difference  quotient  Jacobian  option  were  tested.  (For 
GEARBI,  there  is  no  difference  quotient  option.)  The  results  of  the  various  runs  are  given 
in  Table  1.  Jhe  tabulated  quantities  are: 


R.T.  = 

CPU  run  time  in  sec 

NST  = 

number  of  steps 

NFE  = 

number  of  f  evaluations 

NJE  = 

number  of  j  evaluations 

NLU  = 

number  of  LU  decompositions 

W.S.  = 

total  size  of  work  space  arrays 

in  the  table,  the  notation  USJ  denotes  the  user-supplied  Jacobian  option,  and  DQJ  denotes 
the  internal  difference  quotient  Jacobian  option. 


Table  1 

Results  of  Kinetics-Transport  Test  Problem 


Solver 

R.T. 

NST 

■ 

LSODE 

(USJ) 

23.2 

344 

LSODE 

(DQJ) 

28.4 

337 

LSODA 

(USJ) 

21.3 

338 

LSODA 

(DQJ) 

24.6 

339 

LSODES 

(USJ) 

13.1 

364 

LSODES 

(DQJ) 

13.5 

368 

EP1SODEB 

(USJ) 

18.6 

264 

EP1SODEB 

(DQJ) 

25.1 

264 

G  E  ARBI 
(10x10) 

6.3 

316 

GEARBI 

(38x38) 

180 

383 

NFE 

NJE 

NLU 

W.S. 

518 

68 

68 

14,242 

3338 

69 

69 

14,242 

584 

55 

55 

14,242 

2785 

55 

55  ’ 

14,242 

528 

10 

70 

12,455 

602 

8 

• 

72 

12,664 

461 

81 

81 

14,400 

3782 

81 

81 

14,400 

526 

50 

50 

3,004 

698 

96 

96 

43,324 

For  the  sake  of  illustration,  the  GEAKBI  test  was  repeated  on  a  38x38  grid,  and  the 
results  given  in  the  last  line  of  Table  1.  This  is  the  largest  square  grid  that  could  be 
accommodated  with  that  solver  on  the  CDC-760Q  within  its  Small  Core  Memory  (about 
57,0U0  woras).  as  noted,  the  Large  Core  Memory  (about  400,000  words)  was  used  for  the 
larger  actual  atmospheric  models.  The  Cray-1  computer  will  accept  even  larger  problem 
sizes. 

Several  points  of  interest  can  be  noted  in  the  table,  for  the  10x10  problem.  First,  the 
number  of  steps  does  not  vary  greatly  from  solver  to  solver,  because  that  is  determined 
almost  entirely  by  the  accuracy  requirement,  and  the  accuracy  is  much  the  same  for  all 
these  runs.  The  relative  merits  of  the  solvers  must  be  judged  from  other  statistics. 

The  performance  characteristics  of  LSODE,  LSODA,  and  EP1SODEB  are  similar,  as 
expected,  since  they  all  use  a  banded  Jacobian.  The  variations  are  partly  attributable  to 
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differences  in  the  fine  tuning,  but  the  lact  that  EP1S0DEB  the  smallest  NST  and  (with 
the  USJ  opticn)  the  shortest  run  time  can  be  attributed  to  its  use  of  variable-step 
formulas.  (Recall  that  the  diurnal  kinetics  problems  motivated  the  development  of 
EPISODE.)  The  use  of  a  difference  quotient  Jacobian  is  invariably  more  expensive  here, 
owing  to  its  cost  of  41  evaluations  of  f  for  each  evaluation  of  J  . 

The  LSODES  results  show  that  a  general  sparse  matrix  treatment  gives  a  significant 
speedup  over  the  band  treatment.  This  results  partly  from  the  matrix  software  itself,  and 
partly  from  the  algorithm  of  effectively  saving  old  values  of  J  for  greater  reuse.  Note 
that  each  computed  value  of  J  is  used  for  36  to  46  steps,  as  opposeo  to  only  3  to  6  steps 
with  the  solvers  using  a  banded  Jacobian.  Also,  the  cost  penalty  for  a  difference  quotient 
Jacobian  is  much  smaller  with  LSODES,  because  each  J  evaluation  here  costs  only  8  f 
evaluations.  The  storage  requirement  is  only  slightly  smaller,  reflecting  the  need  for 
sparsity  information  arrays  and  the  fact  the  Newton  matrix  P  is  not  overwritten  with  its 
LU  decomposition,  as  it  is  in  the  band  case. 

The  best  performance  on  this  problem,  however,  is  that  of  GEARBI.  This  should  not 
be  a  surprise,  since  the  Jacobian  has  a  very  regular  block  structure  of  which  the 
block-SOR  method  in  GEARBI  is  taking  full  advantage,  both  in  storage  and  computation. 
The  LU  decompositions  here  are  only  those  of  the  block  diagonal  part  of  the  Newton 
matrix  (with  2x2  blocks).  The  total  number  of  block-SOR  iterations  for  the  10x10  grid 
was  607,  or  an  average  of  less  than  2  per  step.  For  the  38x38  grid  this  cost  rose  to  2122 
iterations,  or  an  average  of  5.4  per  step.  The  latter  run  also  shows  that  spatial 
discretization  errors  in  the  10x10  grid  answers  are  as  large  as  2%  .  For  an  earlier 
comparison  test  on  this  problem,  see  £30j. 

For  large  stiff  systems  with  wioe-bano width  coupling,  of  which  this  is  an  example,  the 
single  most  important  criterion  for  selecting  a  method  or  solver  usually  turns  out  to  be  the 
storage  requirement.  For  this  problem,  on  a  m  x  m  grid,  the  storage  for  a  band-oriented 
solver  is  roughly  22m2  +  12m'1  ,  while  that  for  GEARBI  is  30m2  .  The  storage  for 

O 

LSODES  is  harder  to  predict,  but  behaves  very  roughly  like  150m  in  the  range  m  =  15  to 


25  .  Thus  LSODES  has  an  increasing  storage  advantage  over  LSODE  etc.  for  in  MO  ,  but 

GEARB1  has  a  lower  storage  requirement  than  any  of  them.  However,  if  the  problem  fails 

to  have  the  regularity  needed  for  such  an  approach,  or  if  block-SOR  is  not  appropriate  for 

numerical  reasons,  then  the  choices  seem  to  be  reduced  to 

(a)  solvers  using  a  general  sparse  direct  linear  system  solver,  such  as  LSODES, 

(o)  solvers  using  more  suitable  (possioly  aa  hoc)  iterative  methods  (and  which 
economize  on  matrix  storage  in  some  way),  and 

(c)  radically  different  ad  hoc  treatments  such  as  operator  splitting,  or 
methoas  that  combine  full  implicitness  and  splitting  ideas. 
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ABSTRACT 


The  paper  describes  and  evaluates  DSTIFF,  a  set  of  subroutines  for  solving  stiff 
ordinary  differential  equations.  The  code  is  somewhat  similar  to  the  well-known 
packages  LSODE,  GEAR  and  DIFSUB  but  the  present  set  of  subroutines  are  based  on 
least  squares  multi  step  formulas  rather  than  BDF.  The  paper  describes  the 
formulas  used  In  the  code,  the  structure  of  the  code  and  the  heuristics  used,  and 
evaluates  its  performance.  The  code  seems  to  be  much  more  efficient  than  LSODE 
In  solving  stiff  equations  which  have  jacobians  with  eigenvalues  having  large 
Imaginary  parts. 

On  other  problems,  DSTIFF  is  as  efficient  as  LSODE  on  larger  tolerances  and  some¬ 
what  less  efficient  than  LSODE  on  stringent  tolerances. 


1. 


INTRODUCTION 


Several  codes  are  now  available  for  solving  stiff  ordinary  differential  equations. 
Enright,  Hull  and  Lindberg  (1975)  and  Enright  and  Hull  (1976)  have  tested  codes 
for  solving  stiff  equations  and  Shampine  and  Gear  (1979)  discuss  some  of  the 
available  codes.  Shampine  and  Gear  also  discuss  why  there  is  a  need  to  distinguish 
a  special  class  of  problems  termed  stiff  and  describe  the  common  characteristics  of 
methods  used  for  solving  stiff  equations. 

Most  of  the  codes  available  for  solving  stiff  equations  may  be  divided  into  the 
following  three  classes: 

(a)  BDF  codes  -  The  first  code  using  the  Backward  Differentiation  Formulas  (BDF) 
was  designed  by  Gear  (1971).  This  code,  DIFSUB,  uses  formulas  up  to  order  6. 
Several  attempts  have  been  made  to  improve  this  code  and  several  variants  of 
the  code  are  now  available.  The  best  known  variants  are  GEAR  by  Hindmarsh 
(1974)  and  EPISODE  by  Byrne  and  Hindmarsh  (1975).  GEAR  and  EPISODE  themselves 
have  several  versions  available.  For  example  a  version  is  available  for 
solving  problems  with  banded  jacobians  and  another  for  linearly  implicit 
equations  of  the  form  Ay'  s  g,  A  a  square  matrix.  A  new  version  of  GEAR 
called  LSODE  is  now  available  from  Hindmarsh  (1980).  Several  other  DIFSUB 
variants  are  known  to  exist. 

Test  results  for  codes  cited  above  indicate  that  the  BDF  codes  are  the  most 
efficient  for  solving  stiff  equations  except  when  the  eigenvalues  of  the 
jacobian  of  the  differential  equations  have  large  imaginary  parts.  For  such 
oscillatory  problems,  BDF  codes  become  very  inefficient  because  of  the  poor 
stability  properties  of  the  5th  and  6th  order  BDF.  To  partly  overcome  this 
problem,  several  BDF  codes  restrict  the  maximum  order  of  the  formulas  used 
to  5. 

(b)  Second-Derivative  Codes  -  To  overcome  the  stability  problems  of  BDF, 

Enright  (1972)  suggested  the  use  of  second-derivative  formulas  (SDF);  linear 
multistep  formulas  which  include  second-derivative  terms.  Enright  designed 
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the  first  SDF  code,  SOBASIC,  using  formulas  of  orders  up  to  9.  Other 

SDF  codes  have  been  designed  by  Addison  (1979)  and  Sacks-Davis  (1980). 

Although  the  SDF  used  in  the  above  codes  have  much  better  stability  than 

BDF  and  therefore  overcome  the  problem  of  BDF  codes,  they  have  three 

serious  disadvantages.  These  are:  requirement  of  analytic  jacobians, 

necessity  of  more  linear  algebra  at  each  step  than  in  BDF  codes,  and 

difficulty  in  solving  some  problems  with  large  coupling  between  the 

equations.  The  last  problem  arises  because  the  corrector  iterations  used 

2  2 

in  the  SDF  codes  assume  that  a  f/Zy  terms  for  the  given  differential 
equations  are  negligible.  Testing  of  SDF  codes  indicates  that  they  are 
not  really  very  efficient  when  compared  to  BDF  codes  like  GEAR  except  for 
solving  small  systems  at  stringent  tolerances  or  for  solving  problems  with 
jacobians  having  large  imaginary  parts  for  which  BDF  codes  become  very 
inefficient.  As  noted  above,  the  biggest  problem  in  using  SDF  codes  is 
that  the  analytic  jacobian  must  be  provided.  However,  if  the  jacobian  is 
available,  SDF  codes  can  also  solve  non-stiff  equations  efficiently  because 
thr  truncation  error  coefficients  of  the  SDF  of  Enright  (1972)  are  quite 
small . 

(c)  Runge-Kutta  Codes  -  Implicit  Runge-Kutta  formulas  (RKF)  suitable  for 

solving  stiff  equations  have  been  derived  and  some  codes  based  on  implicit 
RKF  were  tested  by  Enright  et  al  (1975)  and  Enright  and  Hull  (1976).  The 
amount  of  linear  algebra  required  at  each  step  in  using  a  fully  implicit 
RKF  is  so  large,  that  these  formulas  can  not  be  expected  to  be  efficient 
even  for  solving  small  systems  of  equations.  Progress  has  recently  been 
made  in  reducing  the  amount  of  linear  algebra  involved  in  using  RKF  and  a 
code,  STRIDE,  has  been  designed  by  Butcher,  Burrage  and  Chipman  (1979)  based 
on  singly-impl icit  RKF  of  Burrage  (1978).  The  formulas  used  in  STRIDE  are 
A(a)-stable,  a  5  83°  (Widlund,  1967).  To  solve  a  system  of  N  equations, 
STRIDE  needs  to  do  about  the  same  amound  of  0(iv  )  operations  per  step  as 


3. 


<§ 


a  code  based  on  BDF  but  several  additional  O(ff^)  operations  per  step 
must  be  performed.  Kaps  and  Rentrop  (1979)  take  a  different  approach  in 
designing  two  codes  GRK4T  and  GRK4A  based  on  embedded  generalized  RKF 
(also  called  the  Rosenbrock  methods)  of  order  4.  The  formula  used  in 
GRK4A  is  A-stable  while  that  in  GRK4T  is  A  (89. 3®) -stable.  These  codes 
evaluate  the  jacobian  and  then  solve  a  linear  system  of  order  a  with 
four  right  hand  sides  at  each  step.  Preliminary  testing  by  Addison  (1980) 
and  results  presented  by  Kaps  and  Rentrop  (1979)  indicate  that  the  codes 
are  reliable  and  efficient  in  solving  problems  whose  jacobians  have 
eigenvalues  near  the  imaginary  axis.  We  therefore  expect  these  codes  to  be 
useful  in  solving  small  systems  of  stiff  equations  but  the  additional 
linear  algebra  involved  at  each  step  would  make  them  inefficient  for  large 
systems. 


In  addition  to  the  above,  a  code  based  on  cyclic  formulas  of  orders  upto  7, 
STINT,  has  been  designed  by  Tendler,  Bickard  and  Picel  (1978).  Another  code, 
TRAPEX,  based  on  extrapolation  was  tested  by  Enright  et  al  (1975)  and  Enright 
and  Hull  (1976).  Also  Skeel  and  Kong  (1977)  have  shown  that  blended  formulas 
using  Adams  formulas  and  BDF  together  are  suitable  for  solving  stiff 
equations.  The  numerical  testing  has  shown  that  TRAPEX  is  not  an  efficient 
code.  Tendler,  Bickart  and  Picel  (1978)  show  that  STINT  is  reliable  but  not  as 
efficient  as  GEAR.  Only  a  few  test  results  of  a  blended  formulas  code  are 
available.  The  blended  formula  approach  does  overcome  one  of  the  problems  of 
the  SDF  codes  since  it  does  not  require  analytic  jacobians.  Blended  formulas 
still  have  the  other  two  disadvantages  of  the  SDF  codes  regarding  additional 
linear  algebra  and  assumption  of  small  z2f/*y2  in  corrector  iterations. 

The  present  paper  describes  a  code  based  on  linear  multistep  formulas  derived 
using  least  squares.  The  code,  DSTIFF,  overcomes  the  weakness  of  BDF  codes  in 
solving  problems  which  have  highly  oscillatory  solution  components  and  is  almost 
as  efficient  and  reliable  as  the  BDF  'Odes  on  most  other  problems. 
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2.  FORMULAS  USED 

We  have  noted  that  the  BDF  codes  like  GEAR  and  LSODE  are  very  efficient  for 
solving  many  stiff  problems  but  have  trouble  on  systems  which  have  jacobian  with 
eigenvalues  close  to  the  imaginary  axis.  This  difficulty  arises  because  of  the 
poor  stability  of  BDF  of  orders  5  and  6  near  the  imaginary  axis.  To  overcome 
this  Wallace  and  Gupta  (1973),  Gupta  and  Wallace  (1975)  and  Gupta  (1975,  1976) 
have  derived  linear  formulas  having  much  better  stability  than  BDF.  In  fact  some 
formulas  presented  in  Gupta  (1975,  1976)  are  very  close  to  being  A-stable  even  at 
orders  6  and  7.  These  formulas  with  good  stability  however  are  not  very 
efficient  for  solving  stiff  equations  because  the  formulas  have  large  truncation 
error  coefficients  and  tend  to  have  roots  close  to  unity  at  hx  s  ».  This  is  in 
contrast  to  BDF  which  for  orders  5  and  6  have  poor  stability  near  the  imaginary 
axis,  small  truncation  error  coefficients  and  zero  roots  at  hx  *  ®  .  There  are, 
of  course,  a  large  number  of  formulas  somewhere  between  the  two  extremes  and  we 
have  derived  several  sets  in  Gupta  (1975).  We  have  chosen  a  set  based  on  least 
squares  to  be  included  in  DSTIFF.  This  set  is  similar  to  Adams-Moulton  formulas 
(AMF)  but  instead  of  iterpolation  we  use  least  squares  approximation.  To 
clarify  this,  we  first  define  the  predictor  corrector  method  as 

Pnix)  •  Pn_-,(x)  +  C{(x-xn)/h) 

where  we  want  to  compute  a  polynomial  pn(x)  degree  m  at  xn  so  that 
yn  *  P„(x„)  may  be  computed.  ?„_•](*)  is  the  polynomial  approximation  at  xn_1 
and  c  is  a  fixed  polynomial  of  degree  m  characteristic  of  the  particular 
multi  step  formula.  For  AMF  of  order  m  ,  C  is  such  that 

C(-l)  ~  0,  C,(“^)  ~  0 ,  k  s  1,2,  ....  m-1 . 

For  the  formulas  we  use  for  solving  stiff  equations,  we  require  that  C  of  order 
m  satisfy  the  condition  c(-l)  *  0  and  C’  be  a  least  squares  approximation  to 
points  (0,1),  (-1  0),  (-2,0)  ...  (-A\0)  where  N  >,  m-1  and  is  so  chosen  that  the 
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formula  is  stiffly  stable  and  has  small  truncation  error.  These  formulas  were 
labelled  FLS  in  Gupta  and  Wallace  (1975)  and  we  retain  this  name  for  them  although 
@  the  values  of  N  used  for  some  formulas  in  the  present  set  is  different  than  those 

in  that  paper. 


E 


m 


t 


In  Table  1,  we  present  some  of  the  characteristics  of  the  formulas  up  to  order  10. 
Higher  order  formulas  have  been  derived  but  their  inclusion  in  the  code  did  not 
seem  to  affect  the  subroutine's  performance  very  much.  We  will  show  in  the  last 
section  that  it  may  be  desirable  to  further  restrict  the  maximum  order  used  to  7. 

In  Table  1,  a  is  the  angle  defined  by  A(a)  -  stability  (Widlund,  1967).  D  is  the 
most  negative  Re(h\)  value  on  the  stability  curve,  is  the  truncation 

error  coefficient,  and  N  is  the  parameter  used  in  the  least  squares  approximation 
above. 


Order 

a 

D 

V 1 

Modulus  of  Largest  Root 
at  h\  *  » 

Value  of 

N 

v 

« 

i 

1 

90.00 

0.0 

0.500 

0.0 

* 

2 

90.00 

0.0 

0.083 

1.0 

1 

« 

* 

3 

86.46 

-0.075 

0.242 

0.32 

3 

4 

80.13 

-0.282 

0.374 

0.43 

5 

• 

5 

73.58 

-0.606 

0.529 

0.567 

7 

* 

6 

67.77 

-1.218 

0.724 

0.878 

9 

* 

7 

65.53 

-1.376 

1.886 

0.898 

12 

. 

8 

64.96 

-1.149 

7.686 

0.790 

16 

? 

9 

62.78 

-2.086 

16.737 

0.989 

19 

10 

63.74 

-1.223 

133.955 

0.878 

26 

■ 

1 


Table  1:  Truncation  Error  Coefficients  and 
Stability  of  Formulas  FLS 
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3.  ORDER  AND  STEP-SIZE  CHANGING 

It  is  assumed  that  the  reader  is  familiar  with  the  Nordsieck  representation  of 

the  predictor-corrector  methods.  At  any  point  xn  ,  let  an  be  the  vector  of 

scaled  derivatives  of  the  approximating  polynomial  pn(*)  when  a  step-size  of 

h  is  being  used.  We  have  (for  example,  refer  to  Gear  (1971,  p  216)) 
n 

a  =  Aa  +  Iw 
n+1  n 

where  A  is  a  pascal  triangle,  l  is  a  corrector  vector  depending  on  the  multi- 
step  formula  being  used  and  w  is  the  correction  necessary. 

In  DIFSUB,  Gear  (1971,  Section  9.3)  uses  the  following  techniques  in  step-size 
and  order  changing.  Let  the  present  order  be  q  and  the  present  step-size  be 
h 

n 

i)  a  has  q+1  elements.  The  last  element  a  a  ffl 

n  ^  n,<7+l  n  ^ 

Therefore  the  local  truncation  error,  which  is  equal  to 

kq+1  +  0 (*Sf2).  can  be  estimated  by  .  q'.  %f<?+r 

(  va  ,  being  the  correction  applied  to  the  last  element). 

n,  q + 1 

Similarly,  estimate  of  the  local  truncation  error  for  order  q-1  is  given 
by 

k  .  a  .  q\ 

q  n,q+l  ^ 

and  at  order  q+1  by 

V 1  •  q'  •  (?2  Vq+l* 

Let  us  call  these  three  estimates  E ^  Eq_]  an{*  Vi  respectively. 
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( 1  i )  Let  h  *  a ,h  where  k  =  a- 1,  a,  q+l .  New  step-sizes  for  oders  q,  q- 1 
n+ '  k  n 

and  a+1  are  now  calculated.  If  the  user  specified  error  tolerance  is  e. 


the  ratios,  c^, 
aR 


where  for  each  member  of  the  system  there  is  a  weight  w  and  ||.||  is  the 
Ig-norm.  The  order  to  be  used  on  the  next  step  is  selected  corresponding  to 
the  largest  of  these  <*'$. 

(iii)  The  following  heuristics  are  also  used. 

(a)  If  the  largest  a  value  is  less  than  1.1,  the  step-size  is  not 
increased. 

(b)  Step-size  is  not  changed  for  q+1  steps  after  the  last  change 
except  if  a  step  fails. 

(c)  Order  can  not  be  increased  if  a  step  fails. 

(d)  If  the  step-size  is  not  increased  after  computing  a's,  the 
step-size  and  the  order  is  not  changed  for  10  steps  except  if 
a  step  fails. 

(e)  If  a  step  fails  throe  times,  the  order  is  set  to  one. 

(f)  If  the  corrector  iterations  do  not  converge  then  the  step- 
size  is  reduced  by  a  factor  of  4. 


are  computed  as  above  but 


In  the  main  integrator  of  DSTIFF  E  ,  E  ,  and  E  .. 

q'  g-1  <7+1 

the  a's  are  computed  as  follows: 


i)  If  e  >  £  ,  that  is,  after  a  successful  step  we  have 


V  OF 


E 
JL 
t  w 


1/cT+l 


V 1 


1 

1.05 


lad 


u 


Vi 


i 

1.05 


1  V<y+2 


lad 


VI 


where  e.  *  e/Jc  ,fc  ■  q-1,  qt  q+1. 


Instead  of  the  z^-norm,  we  use  the  .t^-norm.  The  order  is  then  selected 
corresponding  to  the  largest  a  .  The  main  reason  for  changing  the 
heuristics  Is  that  we  want  to  encourage  the  code  to  go  to  as  high  an 
order  as  possible.  In  BDF  codes,  order  5  or  6  is  reached  quickly 
because  the  truncation  error  coefficients  of  higher  order  formulas  are 
smaller.  For  the  FIS  formulas  used  In  DSTIFF,  higher  order  formulas 
have  larger  truncation  error  coefficients  as  shown  in  Table  1. 

ii)  If  e  >  E  ,  then  the  unsuccessful  step  must  be  repeated  and  the  new 
<7 

step-size  is  based  on 


This  change  Is  desirable  because  the  need  for  reducing  the  step-size 
indicates  that  the  polynomials  approximating  the  solutions  are  not 
very  good  approximations  at  the  present  point.  Therefore  the  step- 
size  needs  to  be  reduced  by  a  larger  factor  than  would  be  reduced  by 
formulas  in  (i)  above. 

(iii)  and  the  following  heuristics  are  used: 

(a)  If  the  largest  a  value  is  less  than  1.025  then  the  step-size  is 
not  increased. 

(b)  Step-size  is  not  changed  for  <?+l  steps  after  the  last  change 
except  if  a  step  fails. 

(c)  Order  cannot  be  increased  if  a  step  fails. 

(d)  If  a  step  fails  twice  and  the  order  being  used  is  more  than  3, 
the  order  is  reduced  by  one. 

(e)  If  a  step  fails  three  times,  then  the  order  is  set  to  one. 

(f)  If  the  corrector  iterations  do  not  converge  then  step-size  is 
reduced  by  a  factor  of  2  and  the  step-size  and  the  order  is  not 
changed  for  the  next  10  steps  (except  if  a  step  fails). 

(g)  DSTIFF,  GEAR  and  LSODE  use  fixed-step  interpolation  when  step- 
size  is  varied.  This  means  that  the  coefficients  of  formulas  do 
not  change  when  the  step-size  is  changed.  In  DSTIFF  also  we  use 
fixed-step  interpolation  most  of  the  time,  but  we  use  a  technique 
called  "average- step  interpolation"  discussed  by  Gupta  and  Wallace 
(1979)  whenever  the  step-size  is  reduced.  New  coefficients  of  formulas 
are  computed  for  (q+Z)/ 3  steps  after  a  step-size  reduction  takes  place. 

4.  CORRECTOR  ITERATIONS 

To  compute  the  correction  necessary  so  that  the  approximating  polynomial  at  the  new 
point  x^i  satisfies  the  differential  equation,  an  iterative  procedure  must  be 
used.  For  stiff  equations,  the  simple  Iterations  do  not  converge  and  the  modified 
Newton's  iterations  need  to  be  used.  Using  the  Nordsieck  notation,  the  iterations 
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may  be  expressed  as  follows  (Gear,  1971,  p  207) 


“VUa+l)  *  VH,(»0  '  *  ^0 


fln+l  (m)  mth  approximation  to  the  vector  <zn+i»  l  is  the  corrector 

vector  with  first  coefficient  £g,  J  is  the  jacobian  and  F  is  a  function  such 
that  for  a  vector  a  ,  F[a)  -  fc/Ug)  -  •  F(a)  =0  if  vector  a  satisfies  the 

differential  equation. 


As  common  with  other  codes,  DSTIFF  evaluates  J  only  when  necessary  and  the  matrix 
-I  +  hlQ  J  is  stored  as  its  LU-decomposition.  The  present  version  of  DSTIFF  does 
not  have  any  sparse  matrix  facilities  to  solve  large  problems  more  efficiently. 

An  important  part  of  corrector  iterations  technique  is  the  convergence  test  used 
in  terminating  the  Iterations.  In  DIFSU8,  the  iterations  are  terminated  if  the 
following  test  is  satisfied. 


u  is  a  weight  and  Lm  -  norm  is  used,  t  Is  the  user-specified  error  tolerance  and 
N  is  the  number  of  equations.  The  constant  ^g/2%+2)  is  empirical  and  used  to 
ensure  that  the  implicit  equations  are  solved  somewhat  more  accurately  than  c  . 


Hindmarsh  (1974)  uses  a  slightly  different  convergence  test  in  GEAR  in  an  attempt 
to  reduce  the  number  of  iterations.  The  test  is  based  on  the  assumption  that 
corrector  iterations  converge  linearly  and  the  rate  of  convergence  is  almost  constant. 

Let 

d  * 
m 


v 
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v  is  a  weight  depending  on  the  error  criterion  and  Z^-norm  is  used  in 
computing  dm .  a is  the  rate  of  convergence.  The  following  convergence  test 
is  used 

2  a  d  <  ei0 
m  m  '  2 ( q+2 ) 

2  cm  dm  is  an  estimate  of  the  correction  at  the  next  iteration.  The  rate  of 

convergence,  c  ,  is  not  available  at  the  first  iteration  of  each  new  step  and  the 
n 

last  value  of  from  the  previous  step  is  used.  DSTIFF  also  uses  this  test  of 

corrector  convergence.  In  LSODE,  the  convergence  test  has  been  modified 

further,  d  is  computed  as  above  but  the  RMS-norm  is  used  instead  of  the  L0- 
m  <- 

norm.  The  convergence  test  seems  to  be  the  following 

1,5  cm  (dm  q  *'  km+l  lm  ^  *  2 (q+2) 

where  k  .  is  the  truncation  error  coefficient  of  the  formula  beina  used  and 
m+ 1 

l  is  the  last  coefficient  of  the  formula  vector  l  .  In  addition,  if  the  rate 
m 

of  convergence  cm  becomes  greater  than  2  on  the  second  or  the  third  iteration, 
the  code  assumes  that  the  iterations  are  diverging. 


Recently  Shampine  (1979,  1980)  has  dicussed  the  problem  of  testing  for  corrector 
convergence  and  pointed  out  some  weaknesses  of  the  tests  discussed  above. 


5.  STRUCTURE  OF  DSTIFF 


DSTIFF  consists  of  eight  subroutines  and  the  user  provides  two  subroutines  and 
the  calling  program  as  discussed  in  Gupta  (1979).  Out  of  the  eight  subroutines 
in  DSTIFF,  five  subroutines  (SDRIVE,  DEC,  SOL,  PSET,  INTERP)  are  very  similar 
to  those  used  by  Hindmarsh  and  Byrne  in  GEAR  and  EPISODE.  The  remaining  three 
subroutines  (SSTIFF,  METHOD,  NEWHQ)  are  quite  different.  We  discuss  the 
subroutines  briefly. 

(1)  SDRIVE  is  the  driver  subroutine  for  the  main  integrator  SSTIFF.  SDRIVE 

checks  the  parameters  passed  to  it,  and  calls  SSTIFF  repeatedly 
till  the  integration  reaches  the  end-point  specified  by  the  user. 
The  calling  sts.ence  of  DSTIFF  is  the  same  as  EPISODE  and  GEAR. 

(2)  DEC  and  are  subroutines  for  solving  linear  equations. 

SOL 

(3)  SSTIFF  is  the  main  integrator.  It  takes  one  step  and  returns  control  to 

the  calling  routine.  On  the  first  call,  the  subroutine  sets  the 
order  to  one  and  computes  a  suitable  initial  step-size.  At  the 
end  of  one  step  of  integration,  the  subroutine  computes,  if 
necessary,  new  step-size  and  order  to  be  used  for  the  next  step. 

(4)  METHOD  specifies  the  coefficients  of  the  formulas  being  used  and  the 

truncation  error  coefficients  of  the  formulas.  The  three 
dimensional  array  PERTST  used  in  DIFSUB  and  GEAR  has  not  been 
used  so  that  the  subroutine  is  easier  to  understand. 

(5)  NEWHQ  is  called  by  SSTIFF  when  new  step-size  and  order  needs  to  be 


computed.  Separating  the  step-size  and  order  changing  techniques 
into  a  subroutine  makes  it  easier  to  understand  and  modify  the 


(6)  PSET  is  called  by  SSTIFF  to  process  the  jacobian.  PSET  numerically 

evaluates  the  jacobian  if  necessary  and  calls  DEC  to  obtain  a  LU 
decomposition  of  the  linear  equations. 

(7)  INTERP  is  an  interpolation  routine  used  when  the  user  wants  the  solution 

at  intermediate  points. 

The  overall  structure  of  DST1FF,  therefore,  looks  like  the  following: 


6.  PERFORMANCE  EVALUATION  OF  DSTIFF 

Enright,  Hull  and  Lindberg  (1975),  Enright  and  Hull  (1976),  and  Addison  (1981) 
have  tested  several  subroutines  for  solving  stiff  equations.  As  we  have  noted, 

8DF  codes  have  been  found  to  be  most  efficient.  We  therefore  compare  the 
performance  of  DSTIFF  with  LSODE,  the  most  recent  BDF  code.  Tables  2  and  6 
compare  the  performance  of  LSODE  and  DSTIFF  in  solving  the  test  problems  in  the 
testing  package  described  in  Enright  (1979).  The  testing  package  New  Detest 
differs  from  the  package  used  in  Enright  et  al  (1975)  in  that  the  code  being 
tested  does  not  need  to  be  modified  and  normalised  statistics  if  required  are 
provided.  New  Detest  includes  a  set  of  30  test  problems  consisting  of  25  test 
problems  in  five  classes  A,  B,  C,  D  and  E  used  in  Enright  et  al  (1975)  and  5 
additional  chemical  kinetics  problems  in  class  F.  We  have  presented  the  over¬ 
all  results  for  each  of  the  six  problem  classes.  (New  Detest  was  run  on  an  IBM 
3031/6  at  the  Asian  Institue  of  Technology,  Bangkok.)  Absolute  error  criterion 
was  used  in  the  testing.  Detailed  results  of  DSTIFF  are  presented  in  Gupta  (1982) 

We  note  that  LSODE  and  DSTIFF  are  similar  codes  in  many  ways.  Both  codes  use 
Nordsieck  representation  of  the  linear  multistep  formulas  and  both  carry  out  about 
the  same  amount  of  work  per  step.  Therefore  the  statistics  provided  by  New  Detest 
are  a  good  basis  for  comparing  the  two  codes. 


PROBLEM  CLASS 

• 

C 

D 

I 

E 

F 

OVERALL 

10**-2 

346 

1 

804 

378 

534 

2135 

wm 

DSTIFF 

10**-4 

957 

mi  i#m 

1715 

962 

1207 

2927 

1 0**-6 

1970 

fefifl 

2891 

1686 

2235 

4633 

wsM 

10**-2 

250 

3613 

528 

300 

381 

1198 

i 

5820  1 

LSODE 

10**-4 

535 

3672 

1028 

623 

712 

2086 

8656 

10**-6 

1007 

4622 

1711 

1267 

1375 

3634 

13616 

Table  2:  The  Total  Number  of  Function  Evaluations 
(Overall  results  of  each  Problem  Class) 
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PROBLEM  CLASS 

m 

D 

B 

B 

E 

Ll. 

OVERALL 

10**-2 

61 

72 

78 

60 

59 

159 

489 

DSTIFF 

1 0**-4 

88 

90 

104 

96 

82 

145 

605 

10**-6 

106 

98 

118 

130 

101 

217 

770 

10** -2 

59 

206 

86 

67 

75 

183 

676 

LSODE 

10**-4 

86 

248 

127 

112 

100 

229 

902 

10**-6 

119 

289 

160 

166 

144 

346 

1224 

Table  3:  The  Total  Number  of  Jacobian  Evaluations  and  LU 
Decompositions  (Overall  results  of  each  Problem  Class) 


PROBLEM  CLASS 

B 

— 

C 

B 

E 

F 

OVERALL 

10**-2 

HU] 

3.335 

2.560 

0.742 

1.130 

5.131 

14.640 

DSTIFF 

1 0**— 4 

rani 

10.434 

6.375 

2.099 

3.191 

9.123 

37.733 

10** -6 

>  s 

22.416 

12.376 

4.515 

6.993 

16.789 

77.207 

10**-2 

1.664 

18.288 

2.159 

1.261 

3.969 

28.271 

LSODE 

10**-4 

3.765 

21.490 

4.355 

1.9581 

2.499 

7.032 

41.100 

10**-6 

7.100 

26.282 

7.331 

5.049 

12.678 

62.609 

Table  4:  The  Total  Time  In  Seconds 

(Overall  results  of  each  Problem  Class) 


PROBLEM  CLASS 

A 

B 

C 

B 

E 

F 

OVERALL 

DSTIFF 

10**-2 

204 

484 

390 

173 

wm 

918 

2419 

620 

1176 

866 

467 

1349 

5085 

1189 

2139 

1526 

869 

1180 

2364 

9267 

LSODE 

1 0**-2 

187 

2804 

395 

184 

255 

803 

4628 

444 

3123 

805 

406 

542 

1473 

6793 

10**-6 

832 

3989 

1392 

922 

1100 

2767 

11002 

Table  5:  The  Total  Number  of  Steps  Taken 
(Overall  results  of  each  Problem  Class) 


Fraction  of  Steps  oh-whlch  Local 

Error  > 

Toler 

ance 

PROBLEM  CLASS 

j— ——— 

A 

B 

C 

D 

E 

F 

OVERALL 

— 

10**-2 

0.000 

0.004 

0.010 

0.006 

0.020 

0.024 

0.014 

DSTIFF 

10** -4 

0.000 

0.006 

0.016 

0.004 

0.013 

0.020 

0.011  1 

10**-6 

0.004 

0.008 

0.006 

0.000 

0.011 

0.019 

0.010 

10**-2 

0.059 

0.339 

0.096 

0.098 

0.141 

0.173 

0.258 

LSODE 

1 0**-4 

0.027 

0.443 

0.063 

0.059 

0.110 

0.145 

0.256 

10**-6 

0.053 

0.359 

0.093 

0.060 

0.043 

0.145 

0.192 

Table  6:  The  Local  Error  Deception 
(Overall ^results  of  each  Problem  Class) 

We  note  that  DSTIFF  is  very  efficient,  specially  at  larger  tolerances.  The  main 
reason  for  the  much  better  overall  performance  of  DSTIFF  Is  the  problem  B5  which 
is  solved  by  DSTIFF  very  efficiently. 


Wo  note  that  both  DSTIFF  and  LSODE  failed  to  solve  problem  F3  at  tolerances 
of  10"2  and  10'4,  and  problem  F5  at  10-2.  In  addition,  DSTIFF  failed  to 
successfully  Integrate  problem  E5  at  tolerances  of  10"2  and  10-4.  Problem  E5 
Is  a  semi-stable  chemical  kinetics  problem  which  can  become  unstable  If  a 
computed  solution  becomes  negative.  Shamplne  (1981)  and  Enright  and  Hull  (1976) 
discuss  the  difficulty  of  solving  such  problems. 

On  the  basis  of  the  above  testing  we  can  say  that: 

(1)  Code  DSTIFF  seems  to  be  more  reliable  than  LSODE  and  the  error  deception 
of  DSTIFF  Is  much  smaller  than  for  LSODE. 

(2)  The  number  of  Jacobian  evaluations  are  smaller  for  DSTIFF  than  for  LSODE 
because  LSODE  reevaluates  jacoblans  regularly  even  if  there  Is  no  need  for 
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(3)  DSTIFF  solved  the  problems  in  Class  B,  much  more  efficiently  than  LSOOE. 

This  Is  because  the  problem  B5  was  solved  by  DSTIFF  in  134,  358  and  615 
steps  at  the  three  tolerances  respectively  while  LSODE  took  2351,  2397  and 
2620  steps  respectively.  Problem  B5  has  eigenvalues  very  close  to  the 
imaginary  axis  (-10  t  1001). 

(4)  On  problems  other  than  those  which  have  large  Imaginary  eigenvalues,  LSODE 
is  usually  more  efficient  than  DSTIFF  but  is  usually  less  reliable 

in  keeping  the  local  error  below  the  user  specified  tolerance. 

(5)  Somewhat  surprisingly.  In  spite  of  the  higher  order  formulas  used  by  DSTIFF, 
LSODE  seems  to  be  the  more  efficient  code  at  stringent  tolerances.  Partly 

:  this  is  due  to  the  high  truncation  error  coefficients  and  roots  of  the 
order  of  0.9  at  infinity  of  the  higher  order  formulas  In  DSTIFF. 

*  • 

(6)  The  average  number  of  function  evaluations  per  step  Is  close  to  2.0  for 

DSTIFF  while  for  LSODE,  It  Is  only  about  1.25.  The  smaller  number  of 
function  evaluations  per  step  by  LSODE  seem  to  be  partly  responsible  for 
larger  local  error  deceptions.  Also  if  the  corrector  convergence  tests  of 
both  codes  were  to  be  changed  to  the  test  recomnended  by  Shamplne  (1979) 
to  Improve  reliability,  we  expect  b<*‘  .ides  to  take  about  two  function 
evaluations  per  step.  Therefore  the  probably  will  have  much  more 

Impact  on  the  performance  of  LSODE  than  on  the  performance  of  DSTIFF, 

(7)  The  CPU  time  taken  per  step  on  the  average  Is  close  to  0.006  seconds  for 

LSODE  but  for  DSTIFF,  the  times  are  0.0060,  0.0074  and  0.0083  seconds  for 

-2  -4  -6 

the  three  tolerances  10  ,  10  and  10  respectively.  Time  per  step  Is 
larger  for  DSTIFF  partly  because  of  the  larger  number  of  function 
evaluations  per  step  and,  at  stringent  tolerances,  due  to  the  higher  order 
formulas  used. 


Another  set  of  tests  were  run  to  evaluate  the  performance  of  DSTIFF  In  solving 
stiff  equations  when  analytic  jacoblan  Is  not  available  and  the  code  must 
approximate  the  jacoblan  by  finite  differencing.  Table  7  gives  the  overall 
results  for  DSTIFF  and  LSODE.  DSTIFF  failed  to  complete  the  Integration  for 
Problem  F4  at  TOL  *  10”®  and  therefore  results  for  the  tolerance  10”6  for  LSODE 
also  do  not  Include  the  cost  of  solving  F4. 


TOL 

TIME 

FCN 

CALLS 

MAT 

FACT 

• 

NO  OF  STEPS 

10”2 

14.884 

7244 

488 

2417 

DSTIFF 

10”4 

37.833 

12804 

605 

5085 

W* 

64.714 

16884 

648 

7522  (*) 

10"2 

27.797 

8506 

652 

4510 

LSODE 

10"4 

40.942 

12394 

882 

6723 

10-6 

54.887 

15700 

1028 

9007  (*) 

Table  7:  Overall  Results  with  Numerically  Approximated  Jacoblan 
(*  excluding  cost  of  solving  problem  F4) 


To  evaluate  the  effect  of  high  order  formulas  on  the  efficiency  of  DSTIFF, 


we  conducted  another  set  of  tests.  He  used  DSTIFF  with  the  maximum  order  of 
formulas  restricted  to  5  and  then  with  maximum  orders  of  7  and  10  to  solve  the 
New  Detest  stiff  problems.  The  results  obtained  are  summarised  In  Table  8. 


4  *. 
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For  better  comparison  of  results  we  have  excluded  class  F  problems  from  the 
suitmary  In  Table  8.  This  has  been  done  because  for  maximum  order  of  5  we  had 


three  failures  In  Class  F  while  maximum  order  7  and  10  had  only  two  failures. 

_2 

We  note  that  the  results  at  TOL  3  10  are  not  affected  very  much  by  setting 
maximum  order  to  as  low  as  5.  For  more  stringent  tolerances,  maximum  order  of 
7  seems  to  be  better  than  maximum  order  of  5  or  10  since  the  number  of  function 
evaluations  and  the  number  of  steps  for  maximum  order  7  are  the  lowest. 


m 

MAX  ORDER 

TOL 

TIME 

FCN 

CALLS 

MAT 

FACT 

NO.  OF 
STEPS 

*  ■. 

:<V 
>•  ’ 

•< . 

ft 

5 

10‘2 

10“4 

10‘6 

• 

OVERALL 

9.308 

24.257 

50.244 

83.809 

3210 

7042 

13769 

|  24021 

331 

473 

520 

1324 

1551 

3857 

7631 

13039 

CM 

1 

O 

9.208 

3081 

331 

1499 

to 

7 

10~4 

26.375 

7134 

448 

3694 

Pr 

io-6 

53.204 

12743 

565 

6841 

»“ 

OVERALL 

88.787 

22958 

1344 

12034 

m 

9.509 

3063 

330 

1501 

10 

28.610 

7303 

460 

3736 

60.418 

12939 

553 

6903 

98.537 

23305 

1343 

12140 

E 


Table  8:  Overall  results  when  maximum  order  In  DSTIFF  Is 
set  to  5,  7  and  10  (excluding  Class  F) 


7.  CONCLUDING  REMARKS 


The  code  DSTIFF  has  been  shown  to  be  a  reliable  and  efficient  code.  It  Is  much  more 
efficient  than  LSODE  In  solving  problems  which  have  jacoblans  with  eigenvalues 
having  large  Imaginary  parts.  On  other  problems,  DSTIFF  Is  almost  as  efficient 
as  LSODE  on  larger  tolerances  and  somewhat  less  efficient  than  LSODE  on  stringent 
tolerances. 

The  evaluation  of  DSTIFF  using  New  Detest  Is  only  one  Indicator  of  its  performance. 
For  example,  the  Jacoblans  In  New  Detest  are  so  cheap  to  evaluate  that  the  fact 
that  one  code  takes  smaller  numbers  of  Jacobian  evaluations  Is  not  really 
reflected  In  the  CPU  time  statistics  provided  by  New  Detest.  We  further  note  that 
If  a  code  falls  on  one  or  more  of  the  problems  in  New  Detest,  comparison  of 
codes  becomes  very  difficult.  Also  It  Is  difficult  to  say  how  serious  a  failure 
should  be  considered  when  some  of  the  problems  In  the  set  are  very  difficult 
problems.  Recently  Shamplne  (1981)  has  critically  looked  at  the  set  of  problems 
In  New  Detest  (except  Class  F)  and  It  Is  hoped  that  New  Detest  will  be  modified 
In  line  with  Shamplne 's  comments. 

We  hope  to  further  Improve  DSTIFF  and  are  studying  why  DSTIFF  doesn't  perform 
better  at  stringent  tolerances.  We  are  looking  at  the  order  chaning  algorithm 
and  are  considering  reducing  the  maximum  order  used  In  the  code. 
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O.  Introduction 


The  numerical  integration  of  "stiff  ordinary  differ¬ 
ential  equations  (ODE's)  is  one  of  the  active  fields 
of  research  in  numerical  analysis.  Among  the  possible 
applications  of  any  stiff  integration  method,  the  class 
of  problems  arising  from  maea  action  chemical  kinetics 
is  of  still  increasing  interest.  It  is  therefore  only 
natural  that  the  development  of  a  new  stiff  integrator 
on  an  extrapolation  basis  by  Bader/ Deuflhard  [1]  has 
also  led  to  the  development  of  a  software  package  de¬ 
signed  to  handle  large  chemical  kinetics  problems.  De¬ 
tails  about  this  package  LARKIN  have  been  presented  by 
Deuflhard/Bader /Nowak  [4]  in  the  proceedings  of  a  recent 
workshop  on  modelling  of  chemical  reaction  systems  (ed. 
by  Ebert/Deuf lhard/ JHger  [ 6 ] ) .  This  early  version  of 
LARKIN,  hcwever,  was  portable  only  on  IBM  computers.  In 
addition,  part  of  that  code  was  written  in  PL/1 ,  (where¬ 
as  most  of  it  was  written  in  FORTRAN) .  These  disadvant¬ 
ages  of  the  previous  code  were  certainly  prohibitive 
for  a  wider  distribution. 

It  is  the  purpose  of  the  present  article  to  report  on 
progress  made  in  the  development  of  the  package  LARKIN 
since  then.  The  present  code  which  is  exclusively  writ¬ 
ten  in  FORTRAN  IV,  should  be  portable  on  a  wide  range 
of  computers.  Moreover,  it  permits  a  user  to  attack 
much  larger  problems.  This  feature  has  been  essentially 
achieved  by  the  implementation  of  a  new  "chemical  comp¬ 
iler  system"  and  a  new  sparse  solver  (for  details  see 
Duff /Nowak  [5]). 

A  general  outline  of  the  structure  and  some  special  prop¬ 
erties  of  LARKIN  are  given  in  section  1 .  Input/output 
facilities  are  described  in  section  2.  Section  3  contains 


details  of  the  "chemical  compiler  system "  that  automat¬ 
ically  generates  the .ODE  right-hand  side  together  with 
the  associated  Jacobian  matrix  and  its  sparse  pattern 
from  the  input  in  terms  of  chemical  reaction  equations 
(and  rate  constants) .  A  short  presentation  of  the  se¬ 
lected  stiff  integrator  of  extrapolation  type  follows  in 
section  4,  where  also  a  rough  and  short  comparison  with 
the  competing  BDF-type  integrator  is  made. 

The  philosophy  behind  the  development  of  this  code  has 
been  to  allow  any  chemical  user  to  concentrate  on  the 
modelling  aapeota  of  his  chemical  reaction  system  under 
consideration.  In  view  of  this  aim,  the  package  has  been 
developed  in  direct  and  steady  contact  with  a  local  group 
of  physical  chemists  that  have  been  using  the  system  suc¬ 
cessfully  for  quite  a  while  (see  e.g.  [12]).  Therefore, 
the  software  package  LARKIN,  as  it  stands  now,  should  be 
a  useful  and  reliable  tool  also  in  other  chemical  research 
environments . 
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1 .  General  Description  of  the  Package 

In  developing  LAPKIN/  the  main  emphasis  has  been  put  onto 
the  implementation  of  an  interactive  dialogue  system.  In 
order  to  be  able  to  use  this  system,  a  user  is  only  re¬ 
quired  to  know  details  about  his  chemical  model  (and  cer¬ 
tainly  not  details  about  the  computer  system  or  about 
numerical  analysis!).  In  particular,  the  quest ion-answer 
dialogue  contains  only  either  self-explaining  or  chemical 
terms.  There  is  no  command  language  to  be  memorized  by 
the  user. 

The  package  LARKIN  as  a  whole  is  written  in  a  "reasonably 
portable"  FORTRAN.  The  FORTRAN  compiler  needs  to  be 
called  essentially  once,  when  the  load  module  is  pro¬ 
duced  from  the  source  program  (typical  service  of  a  com¬ 
puting  center) .  Further  calls  of  the  FORTRAN  compiler  are 
only  necessary,  if  changes  in  the  size  of  the  allocated 
storage  are  made.  All  model  changes  (say,  changes  of 
chemical  reactions ,  initial  values ,  rate  constants ,  . . . ) 
can  be  easily  and  quickly  performed  within  the  frame  of 
the  dialogue  system. 


Fig.  1 :  structural  diagram  of  LARKIN 


A  structural  diagram  of  the  package  LARKIN  is  represented  in 
Fig.  1 .  Details  concerning  the  individual  main  blocks  are  as 
follows : 

EDIT  -  input  dialogue  block:  accepts  chemical  input  and  trans¬ 
fers  it  to  the  database  CHEMIN;  the  contents  of  CHEMIN 
are  available  in  EDIT(  for  possible  modifications).  For 
more  details  see  section  2. 

COMPILER  -  "chemical  compiler"  block:  reads  contents  of  data¬ 
base  CHEMIN;  checks  syntax  of  chemical  input;  produces 
all  pertinent  information,  which  is  necessary  for  the 
numerical  simulation;  stores  this  information  in  binary 
database  INVAL.  For  more  details  see  section  3. 

SIMULA  -  numerical  integrator  block:  performs  numerical  sim¬ 
ulation  of  chemical  reaction  system  using  information 
from  database  INVAL.  In  the  course of  the  integration, 
data  of  chemical  interest  (usually  not  only  chemical 
species  concentrations)  are  updated  in  database  OUT. 

For  details  concerning  the  stiff  integrator  see  sec¬ 
tion  4. 

OUTPUT  7  output  dialogue  block:  gives  (possibly  selected) 
output  information  in  either  graphical  or  numerical 
form  as  requested  by  the  user;  requires  interface  to 
plot  software  of  each  individual  computing  center.  For 
more  details  see  section  2. 

The  whole  package  LARXIN,  as  it  stands  now,  applies  to  the 
numerical  solution  of  mass  action  kinetics  problems,  which 
involves  the  numerical  integration  of  large  stiff  ODE  systems 
with  polynomial  right-hand  sides.  The  standard  formulation  is 
in  terms  of  molar  concentrations  (changes  to  mass  or  mole 
fractions  are  possible,  compare  section  2) .  The  additional 


treatment  of  diffusion  or  the  handling  of  differential  •- 
algebraic  systems  is  presently  not  possible  (in  preparation) . 
Generally  speaking,  the  modularized  form  of  the  package  facil¬ 
itates  changes  of  individual  blocks. 

In  view  of  the  possible  solution  of  very  large  systems,  storage 
economy  has  had  a  high  priority  in  the  development  of  the  pack¬ 
age.  with  the  notation 

NSP:  number  of  species, 

NCEQ:  number  of  chemical  equations 

(reversible  reactions  count  as  2) , 

NZE:  number  of  non-zero  elements  in  Jacobian 
matrix, 

NLU:  maximum  number  of  non-zero  elements  in 
LU-decamposition , 

the  total  workspace  required  can  be  estimated  as  follows: 

integer  workspace: 

X:>  NLU  +  NZE  +  6  •  NCEQ  +  15  •  NSP 
real  workspace: 

R:«  NLU  +  NZE  +  3  •  NCEQ  ♦  15  •  NSP 

Upon  referring  to  the  rough  considerations  of  [4]  ,  the  fol¬ 
lowing  rules  of  thumb  are  obtained  (with  r _ :■  7,  «:■  2)  : 

max 

6  •  NCEQ  +  36  •  NSP 
R#gt-  3  •  NCEQ  +  36  •  NSP 

For  the  largest  example  (SNA- polymerization)  presented  in  this 
paper,  these  numbers  lead  to  34 K  from  X  t  (in  Integer  *2)  and 

60  K  from  R#lt  in  single  precision  or  1 20 K  from  R##t  in  double 

precision.  For  comparison,  the  actually  required  array  storage 


in  this  example  turned  out  to  be  32  K  from  I,  55  K  from  R 
in  single  precision ,  and  11  OK  in  double  precision.  These  num¬ 
bers  may  be  compared  with  about  100  K  for  the  object  code  of 
the  total  package.  Considerable  savings,  however,  can  be  made 
by  using  OVERLAY  techniques. 

As  a  consequence,  the  present  version  of  LARKIN  should  be 
helpful  in  simulating  comparatively  large  chemical  reaction 
systems  on  moderate  size  computers. 

2.  Input/Output  Facilities 

This  section  deals  with  details  of  the  two  dialogue  blocks 
EDIT  and  OUTPUT,  as  far  as  they  are  relevant  from  the  user's 
point  of  view.  A  complete  description  of  the  syntax  of  the 
chemical  input  would  be  beyond  the  scope  of  this  paper .  Rather 
part  of  the  documentation  of  problem  HEXAN  1  of  section  4  is 
inserted  for  illustration  purposes.  First,  the  block  EDIT  will 
be  described  in  a  sequential  order.  Note,  however,  that  the 
actual  user  input  may  be  given  in  arbitrary  order. 

Head  (optional) .  This  sub-block  may  contain  a  short  documenta¬ 
tion  of  the  problem  (up  to  5  lines) . 

ISBARN, EDERER, EBERT: 

THE  THERMAL  DECOMPOSITION  OF  N-HEXANE, 

KINETICS, MECHANISM,  AND  SIMULATION. 

(  DATE  OF  LATEST  CHANGE:  JAN.  21,  '82  ) 

Elements  (optional)  -  list  of  chemical  element  names,  each  of 
which  may  be  followed  by  its  atomic  weight.  The  names  are  used 
in  the  subsequent  stoichiometric  test.  The  atomic  weights  are 
necessary  for  a  possible  change  from  molar  concentrations  to 
mass  or  mole  fractions  in  the  output. 


H  1.008 
C  12.011 


Spaeiaa  (optional  -  list  of  chamical  species  names,  each  of 
which  may  be  followed  by  the  element  composition  (for 
stoichiometric  test) . 


a 

CH3 

3-M-C4H8 

E-BENZOL 

STYROL 


B  1 

Cl  B  3 

C  5  B  10 
C  8  H  10 
B  8  C  8 


The  order  of  appearance  of  the  names  defines  the  order  in 
the  output.  Further  species  names  appearing  only  in  the 
block  raaaticn  are  automatically  added.  However ,  it  is 
strongly  advised  to  use  the  optional  stoichiometric  test  for 
the  possible  detection  of  input  flaws. 

Raaetxon  -  complete  list  of  irreversible  or  reversible  chem¬ 
ical  reactions,  each  of  which  is  followed  by  the  associated 
kinetic  parameters  (or  (In  A,  AE)  for  Arrhenius  law). 


CB4 

•> 

B 

♦ 

CB3 

(15.3,104000) 

C2H6 

-> 

B  + 

C2B5 

(16.0, 

C2B6 

<->  CB3  +  CH3 

(16.6,  88D3)  (10 

C2B5 

+  1-M-AL 

->  3-M-C5H10 

(  9.0, 

0.0) 

C2B5 

♦  1-E-AL 

■> 

(  9  ,  0) 

02- 

♦  CS+ 

■> 

CS 

+  i 

02 

(0.5D-7) 

CS+ 

♦  E- 

■> 

CS 

(0 .  ID— 11) 

CS 

->  CS+ 

+ 

E- 

. (0.324D-2) 

02- 

•>  02 

4> 

E- 

(0.400) 

02 

+  CS 

4* 

N 

-> 

CS02 

*  M  (0. 

02 

*  02 

♦ 

B- 

■> 

02- 

4-  02  (0. 

02 

♦  E- 

♦ 

N2 

■  > 

02- 

4*  N2  (0 

As  can  be  seen  from  the  above  special  example,  the  equations 
may  be  written  in  a  chemical  standard  nomenclature.  (The  above 
insertions  have  deliberately  been  presented  in  a  rather  irreg¬ 
ular  shape  to  demonstrate  the  robustness  of  the  package.) 

Initial  concentrations  -  list  of  non-zero  initial  concentra¬ 
tions  in  the  form 

C6H14  0.0021 

Temperature  (optional,  fixed  value)  -  only  required,  if  either 
Arrhenius  of  modified  Arrhenius  law  is  used  for  the  rate  con¬ 
stants  ;  may  be  skipped ,  if  rate  constants  are  prescribed  as  numbers 

numerical  Input  -  list  of  additional  information,  which  is 
necessary  in  the  simulation,  such  as: required  relative  accuracy , 
user  prescribed  output  points  (including  initial  and  final 
point;  output  points,  which  are  automatically  selected  by  the 
integrator,  are  independently  stored  in  database  OUT  for  print 
or  plot  purposes),  and  print  option  parameter. 

The  above  sub-blocks  comprise  the  block  EDIT.  Every  (even  line- 
wise)  change  of  the  input  is  automatically  updated  in  database 
CHEMIN. 

All  output  information  is  available  from  database  OUT,  for  fur¬ 
ther  use  either  inside  or  outside  the  package  LARKIN.  The  dia¬ 
logue  block  OUTPUT  offers  possibly  selected  results  of  the 
simulation  either  in  the  form  of  numerical  data  or  in  graphic 
form.  The  numerical  data  in  database  OUT  contain  at  least  the 
concentrations  at  all  intermediate  points  (both  integrator 
selected  and  possibly  user  prescribed) ,  if  not  suppressed  by 
the  above  print  option;  from  the  experience  of  the  authors, 
additional*  numerical  information  like  reaction  rates,  inte¬ 
grated  reaction  rates,  pressure  etc.  are  helpful  in  the  pro¬ 
cess  of  modelling  and  are  therefore  available  from  OUT.  The 
graphic  output  contains  several  options  including  either  single 
or  double  logarithmic  scale  and  phase  diagrams.  Only  minor 


emphasis  has  been  put  into  the  development  of  the  plot  soft¬ 
ware  -  the  adaptation  of  the  graphic  part  of  the  package  any¬ 
way  requires  a  special  interface  to  the  local  plotting  fa¬ 
cilities  . 

In  view  of  any  adaptations,  the  database  OUT  has  been  espe¬ 
cially  well-structured  to  facilitate  the  access. 


3.  Chemical  Compiler  System 

The  above  input  in  terms  of  chemical  reactions,  rate  constants 
and  initial  concentrations  uniquely  defines  an  associated  ODE 
initial  value  problem 

(3.1)  y'  -  f (y)  ,  y(o)  -  y0 

with  time  t  as  independent  variable.  The  actual  generation  of 
this  ODE  system  is  usually  done  by  a  so-called  "chemical  com¬ 
piler"  or  "interpreter",  -  see  [2, 7, 13, 17].  There  is  the 
following  principal  alternative:  (a)  generation  of  explicit 
FORTRAN  subroutines  f  and  fy  (required  by  standard  stiff  in¬ 
tegrators)  with  the  consequence  of  a  necessary  FORTRAN  compi¬ 
lation  step  after  each  model  change,  (b)  symbolic  generation 
of  f  and  fy,  which  are  evaluated  by  means  of  formal  routines 
contained  in  the  chemical  kinetics  package. 

Advantages  of  (a)  are 

•  lower  evaluation  time  for  f  and 

•  slightly  more  transparent  program  structure 
Advantages  of  (b)  are 

•  model  changes  do  not  require  additional  calls  of 
the  FORTRAN  compiler 

•  smaller  storage  for  object  module 

This  implies  that  (b)  is  preferable  for  lavg «  problems  and  is 
certainly  more  convenient  from  the  user ' s  point  of  view  even 
for  smaller  problems.  In  the  standard  version  of  LARKIN,  ver- 


slon  (b)  is  realized;  version  (a)  can  be  produced  by  a  few 
minor  changes.  The  limitations  of  version  (a)  may  be  clearly 
seen  from  its  performance  in  the  following 

Example :  SNA  polymerization  (NCEQ  *  770,  NSP  *  352) 

(due  to  [16]) 

Version  (a) :  The  listing  of  the  source  code  of  f  and  fy  is 
about  70  printed  pages.  After  about  10  minutes  compilation 
time,  the  FORTRAN  compiler  gives  in  with  the  message:  "source 
program  too  large". 

Version  (b) :  A  listing  of  the  symbolic  representation  of  f 
and  fy  would  be  about  1  printed  page.  For  comparison,  note 
that  about  2  sec  are  needed  to  generate  f  and  fy  and  less 
than  1  min.  is  needed  to  simulate  the  whole  chemical  system 
(see  also  section  4). 

•  * 

The  symbolic  generation  of  f  uses  partly  ideas  that  have  been 
suggested  by  St abler /Che sick  [17]  in  their  realization  of  the 
package  HAVCHM.  Unlike  HAVCHM,  however,  but  similar  to  the 
package  CHEMKIN  due  to  Kee/Miller/Jef f erson  [13],  LARKIN  ac¬ 
cepts  a  familiar  chemical  language  formulation  with  only  few 
format  restrictions.  This  user-oriented  structure  of  the  input 
leads  to  the  necessary  implementation  of  a  scanning  procedure. 
All  species  names  appearing  in  the  optional  species  list  or  in 
the  chemical  reactions  are  identified  and  internally  listed  in 
an  integer  representation.  The  integer  position  in  this  list 
(called  NAME)  represents  the  internal  coding.  This  coding  is 
then  used  to  update  the  structure  of  the  reaction  scheme  line 
by  line  in  two  integer  vectors.  In  the  course  of  this  scanning 
process,  the  rate  constants  (following  each  reaction)  are  si¬ 
multaneously  read  and  analyzed.  The  evaluation  of  f  is  done 
in  a  standard  way  as  in  comparable  packages,  say  HAVCHM.  The 
build-up  of  the  list  NAME  needs  a  large  number  (-4*NCEQ)  of 
table  look-ups.  In  order  to  avoid  a  blow-up  of  this  look-up 
time  in  very  large  systems,  a  special  hath  technique  is  used 
(for  details  see  Knuth  [14]). 
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The  symbolic  generation  of  the  Jacobian  would  then  be 
comparatively  trivial,  if  a  full  mode  updating  scheme  were 
applied  (where  each  entry  of  f^  is  characterized  by  its 
row  and  column  index).  However,  in  view  of  large  systems,  a 
sparse  storage  scheme  seems  to  be  crucial,  which  means  that 
in  addition  to  the  non-zero  elements  of  fy  the  associated 
sparse  pattern  of  (I-hfy)  has  to  be  provided.  The  pattern 
representation  and  the  storage  scheme  in  LARKIN  are  on  one 
hand  rather  concise  and  on  the  other  hand  general  enough  to 
permit  the  use  of  any  of  the  presently  available  sparse  sol¬ 
vers.  For  more  details  see  [4].  This  side  condition  implies 
that  the  evaluation  of  is  technically  more  complicated 
than  in  packages  that  just  use  full  mode  representation. 

By  means  of  scanning  routines  similar  to  the  one  described  above 
further  information  from  database  CHEMXN  is  picked  up,  ana¬ 
lyzed,  and  transferred  to  the  binary  file  INVAL.  In  the 
course  of  the  compilation,  a  series  of  error  ahecke  are 
made,  e.g. 

•  syntax  monitor  for  each  line, 

•  check  on  input  inconsistencies, 

•  optional  stoichiometric  test. 

If  any  of  these  checks  is  not  passed,  self-explanatory  warning 
or  error  messages  are  issued. 

The  priorities  in  the  construction  and  implementation  of  the 
chemical  compiler  system  have  been  storage  .economy,  porta¬ 
bility,  and  ease  of  handling  from  the  user's  point  of  view. 
These  priorities  lead  to  a  lot  of  rather  technical  but  impor¬ 
tant  details,  which  are  not  suited  for  a  presentation  within 
the  scope  of  this  paper.  These  details  are  anyway  best  docu¬ 
mented  by  the  code  itself. 
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4.  Numerical  Simulation 

Tha  actual  numerical  simulation  (■  integration  of  the  ODE 
system)  is  performed  in  the  block  SIMULA,  which  uses  infor¬ 
mation  only  from  database  INVAL.  Part  of  the  information, 
which  is  not  used,  is  just  passed  over  to  database  OUT.  An¬ 
other  part  of  the  information  is  preprocessed  before  the 
call  of  the  stiff  integrator  (e.g.  computation  of  the  rate 
constants  by  virtue  of  the  Arrhenius  law) .  In  the  course  of 
the  numerical  integration,  the  intermediate  results  are  trans¬ 
ferred  to  database  OUT  by  an  extra  routine.  Once  more  note  f 
that  this  interface  structure  facilitates  a  possible  exchange 
of  sub-blocks  (such  as  the  integrator) . 

The  stiff  integrator  used  in  LARKIN  is  METASC,  a  special  ver¬ 
sion  of  MET  AN  1  (due  to  [ 1 ] )  adapted  to  large  chemical  kinet¬ 
ics  problems .  The  code  is  based  on  the  semi -implicit  mid-point 
rule  with  polynomial  extrapolation  -  for  details  see  [1,4] 

The  special  adaptation  of  METASC  refers  to 

•  a  scaling  technique,  which  uses  further  information 

’  from  mass  action  kinetics  (see  [41,  p.  45  -  48), 

•  the  additional  computation  of  quantities  of  chemical 
interest, 

•  the  sparse  solver  MA  30  in  the  version  developed  by 
Duff /Nowak  [5]. 

Extrapolation  integrator  versus  BDF  integrator.  Nearly  all  chem¬ 
ical  kinetics  packages  use  a  stiff  Integrator  of  BDF-type  for 
the  numerical  simulation.  The  question  to  be  discussed  now  is 
whether  the  new  stiff  integrator  of  extrapolation  type  is  com¬ 
petitive.  A  number  of  numerical  comparisons  have  already  been 
documented  in  [1,4]  including  standard  test  problems  and 
chemical  kinetics  examples.  In  order  to  give  some  impression 
about  the  performance  of  the  present  version  of  LARKIN,  a  few 
further  chemical  kinetics  examples  are  presented  here  -  using 


either  the  standard  extrapolation  integrator  or  the  BDF- 
integrator  due  to  Gear  [8]  and  Hindmarsh  [10].  Of  course, 
both  integrators  have  been  endowed  with  common  scaling  and 
sparse  techniques.  All  comparison  runs  have  been  made  on  the 
IBM  370/168  of  the  University  of  Heidelberg  using  the  FORTRAN 
optimizing  compiler  with  OPT  *  3.  The  following  3  problems 
may  stand  for  a  larger  set  of  problems  that  have  been  solved 
meanwhile: 

HEXAN1 :  thermal  decomposition  of  n-hexane  (due  to  Isbarn, 

Ederer ,  Ebert  [12])  NCEQ  -  240,  NSP  «  59,  NZE  «  523 

HEXAN2:  extended  model  of  n-hexane  (due  to  Isbarn  [11]) 

NCEQ  «  705,  NSP  -  166,  NZE  »  1270 

RNA:  polymerization  model  of  RNA  (due  to  Schneider, Hein¬ 

richs,  Poll  [16]) 

NCEQ  -  770,  NSP  *  352,  NZE  *  2642 

The  comparative  results  for  the  extrapolation  code  METASC  and 
the  BDF-code  GEARS  for  single  precision  (tolerance  1 .E-3  with 
a  mantissa  of  about  6  decimal  digits)  and  double  precision 
(tolerance  1.D-3  and  1 .D-6  with  a  mantissa  of  about  16  decimal 
digits)  are  listed  in  Table  1 .  The  column  GEARS  with  maxord  » 3 
has  been  added,  because  in  this  case  the  A  (a)  - stability 
properties  of  the  two  underlying  discretizations  are  compara¬ 
ble  -  see  Table  2 .  From  Table  1 ,  the  following  observations 
are  worth  discussing: 
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Table  1 . 

Comparison  of 

simulation 

times  [sec]. 

METASC 

GEARS 
maxord  =  5 

GEARS 
maxord  *3 

HEXAN1 

1  .E-3 

3.98 

Fail** 

Fail** 

t 

1  .D-3 

3.46 

4.09 

3.66 

1.D-6 

7.50 

7.14 

8.83 

HEXAN2 

1  .E-3 

10.63 

12.05 

12.02 

1  .D-3 

12.30 

11.14 

11.10 

1  .D-6 

22.62 

21.15 

24.22 

RNA 

1  .E-3 

48.28 

38.70 

41  .B6 

1  .D-3 

55.68 

44.46 

48.20 

1  .D-6 

114.36 

106.59 

147.23 

•)  fail  run  also  with  maxord  *  2 


(I)  In  the  largest  example  (RNA) ,  the  code- GEARS  with 

maxord  -  5  is  about  20%  faster  than  METASC.  A  possible 
explanation  of  this  fact  might  be  that  the  dynamic  behav¬ 
ior  of  that  chemical  system  turns  out  to  be  rather  reg¬ 
ular  -  compare  Fig.  2.  This  property  favors  multistep 
methods  of  this  kind.  In  the  other  successful  runs,  the 
computing  times  are  nearly  the  same. 


(II) In  the  single  precision  version,  MET AS C  is  more  robust 
than  GEARS.  This  property  seems  to  be  typical  over  a 
wider  class  of  examples. 

Table  2.  Comparison  of  A(a)  -stability  properties  (9] 

(for  discretization  of  order  k) 

BOF  discretization: 

k  1  2  3  4  5  6 

a  90°  90°  86°  76°  50°  16° 

semi-implicit  mid-point  discretization : 

k  1  3  5  7  9  11 

a  90°  90°  90°  87°  86°  56° 

Ih  view  of  storage  economy  (compare  section  2)  and  the  fact 
that  low  accuracy  requirements  are  often  sufficient  for  chem¬ 
ical  applications,  the  single  precision  version  of  LARKIN 
seems  to  be  preferable.  For  low  accuracies,  the  BDF  code  usu¬ 
ally  supplies  enough  intermediate  output  points  to  permit 
the  use  of  a  simple  plot  routine,  whereas  the  extrapolation 
code  tends  to  supply  comparatively  few  output  points.  How¬ 
ever,  the  few  output  points  obtained  by  the  new  order  and 
stepsize  control  (due  to  Deuflhard  [3])  reflect  the  dynamic 
behavior  of  the  chemical  system  so  that  a  more  sophisticated 
plot  routine  (such  as  [15])  can  be  used  instead. 

The  above  comparative  remarks  seem  to  justify  the  decision 
to  use  the  stiff  integrator  of  extrapolation  type  in  the 
simulation  package  LARKIN. 


SPELltS 

2 RNA  problem:  graphical  output  for  349  (out  of  352)  species 
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Chemical  kinetics  -  an  overview  from  the  point  of  view  of 
Numerical  Analysis  and  Software  Implementation 

Lennart  Edsberg 

t.  Introduction 

Numerical  simulation  of  chemical  kinetics  is  one  important  application 
area  of  stiff  computation.  During  the  last  decade,  when  software  for  stiff 
problems  has  been  available,  even  bigger  systems  of  chemical  reactions  has 
been  simulated  on  computers.  Since  most  chemical  kinetics  problems  are 
stiff,  the  numerical  simulation  using  non-stiff  methods  such  as  Runge- 
Kutta  is  in  general  not  successful, so  before  the  existence  of  stiff  nume¬ 
rical  software  it  was  common  to  use  a  combination  of  analytical  as  well  as 
numerical  tools  in  order  to  be.  able  to  obtain  approximate  solutions  of  the 
set  of  ordinary  differential  equations  describing  the  kinetics  of  the  system. 
In  the  analysis  of  such  a  system  the  chemist  is  often  guided  by  some  a 
priori  knowledge  about  the  magnitudes  of  the  concentrations  of  the  com¬ 
ponents.  This  infonnation  has  been  utilized  especially  in  small  systems 
in  a  way  that  sometimes  makes  it  possible  to  partition  the  system  into 
stiff  and  non-stiff  variables.  A  good  example  of  such  an  analysis  can  be 
found  in  Lilt  and  Segel  [t],  pp  304-307,  where  a  biochemical  enzyme  cata¬ 
lysis  reaction  system 

E  +  S  **  C  E  ♦  P  ...(1) 

is  treated.  Setting  up  the  system  of  ordinary  differential  equations  (ODE's) 
for  this  system  and  scaling  the  variables  with  respect  to  their  magnitudes 
leads  to  the  following  set  of  ODE's  for  the  scaled  concentrations  of  S  and  C: 

s  »  -s  ♦  (s  ♦  k  -  X)c  s(0)  »  1 
ec  •  s  -  (s  ♦  k)c  c(0)  “  0  •••(2) 

where  k  and  X  are  parameters  being  functions  of  the  reaction  rates  of  the 
system  and  e  is  a  small  quantity.  -  After  this  analytical  preprocessing 
leading  to  a  singular  perturbation  formulation  of  the  problem  a  familiar 
approximation  method  known  as  the  steady-state  approximation  (SSA)  could 
be  used.  In  the  example  given  above  this  method  consists  of  letting  e»0, 
hence  giving  a  system  of  differential-algebraic  equations.  This  approxima¬ 
tion  of  the  dynamics  of  ...(1)  is  referred  to  as  the  Michael is-Menten 
kinetics. 

Although  the  SSA  has  turned  out  to  be  unreliable  for  general  problems 
the  scaling  procedure  leading  to  it  gives  an  important  piece  of  information 
and  can  be  utilized  numerically. 


In  this  paper  we  give  an  overview  of  chemical  kinetics  problems,  some 
important  mathematical  properties  and  how  the  numerical  treatment  is  per¬ 
formed  in  existing  simulation  programs  for  chemical  kinetics.  There  is 
also  shown  some  recent  results  on  a  scaling  procedure  proposed  by  Dahlquist 
leading  to  a  singular  perturbation  formulation  of  the  ODE's  and  how  this 
information  can  be  used  from  numerical  point  of  view. 


2.  Some  mathematical  properties  of  chemical  kinetics 

Consider  a  closed  system  where  several  chemical  reactions  are  taking  place 
simultaneously.  The  system  is  held  at  constant  temperature  and  the  volume 
does  not  change.  It  is  also  assumed  that  the  system  is  operating  under  homo 
geneous  conditions,  i.e.  no  spatial  gradients  occur.  -  Under  these  very 
idealized  conditions  we  have  a  reactor,  in  chemistry  usually  called  the 
isothermal  batch  reactor.  Although  operating  under  these  restricted  con¬ 
ditions  this  reactor  is  found  in  many  important  applications. 

Assume  that  n  chemical  species  A.,  iat,2,...,n  simultaneously  interact  in 


m  reactions 


l  Pij^ 

i-l  1 


k. 

-1 


i-1  1J  1 


,  3-1.2 . m  ...(3) 


where  p^j  and  q.j  are  integers  (stoichiometric  coefficients)  and  k.  are 
parameters  called  the  rate-constants  of  the  reactions.  Under  the  physical 
conditions  mentioned  above  and  the  assumption  that  the  mass-action  lav 
is  applied  for  the  rate-functions,  the  following  mathematical  model  consis¬ 
ting  of  a  set  of  ordinary  differential  equations  can  be  set  up: 

n 


•  Vq ij  "  Pij>kjvnA  kj 

j-1  J  J  Jk-1 


c.(0)«c.  ,  ...(4) 
i  io 


where  c.  are  the  concentrations  of  A.  and  c.  are  the  concentrations  of  A. 

i  i  io  i 


at  time  t  ■  0.  The  system  ...(4)  can  be  written  in  compact  form: 

c  ■  (Q  -  P)Ki(c)  ,  c(0)  given 


...(5) 


where  Q  and  P  are  n*m  matrices,  K  a  diagonal  matrix  ■  diag(k^)  and  g  is 
a  vector  with  m  components  containing  the  polynomial  expressions  in  ... (4) 
The  vector  Kg  is  denoted  by  r  and  is  a  vector  of  m  components  containing 
the  rate-functions  (formed  by  the  mass  action  law).  The  matrix  S  ■  Q  -  P 
is  referred  to  as  the  stoichiometric  matrix. 

Thus  a  chemical  kinetics  problem  is  specified  by  the  two  matrices  S  and  P 
(usually  being  very  sparse  with  integers  as  entries),  the  diagonal  matrix 
K  (with  only  positive  entries)  and  the  initial  concentration  vector  c(0) 
(with  nonnegative  entries). 


Example  (chemical  pyrolysis,  originating  from  Datta  1967,  the  set  of  ODE's 
being  one  of  the  testproblems  in  the  stiff  integrator  comparison 
by  Enright,  Hull  and  Lindberg  [2]. 


A1  " 


A2  +  A3 


^  ♦  a3  ■* 


A1  +  A3 


-  \ 

-  a3  ♦  Afi 


(6) 


The  corresponding  set  of  ODE's  are  according  to  ...(4) 


i2  -  k,c,  -  k2c2c3 

*3  '  ktc1  '  k2c2c3  '  k3e1c3  *  k4c 

d4  ’  k3c1c3  ‘  k4c4 

h  ‘  k2c2c3 
■  k.c 


(7) 


Only  the  first  four  equations  were  used  in  the  testproblem. in  [2] 


* 

* 

* 

\ 

-1 

0 

-1 

0 

1 

0 

1 

0 

1 

-1 

0 

0 

0 

1 

0 

0 

1 

-1 

-1 

1 

p  • 

0 

1 

1 

0 

0 

0 

1 

-1 

0 

0 

0 

1 

0 

1 

0 

0 

0 

0 

0 

0 

0 

a 

0 

0 

1 

J 

0 

0 

0 

For  this  system  n  ■  6  and  m  ■  4.  The  rate  constants  used  in  the 
test  problem  were  kj  ■  7.89*10  ^  ,  kj  ■  1.13*1 0^ ,  k^  ■  1.1*1 0^ 
k^  ■  1.13- 10^  and  the  initial  values  Cj(0)  ■  1.76*10  \  the 
others  were  set  to  zero. 

Note  that  +  c^  and  2c ^  +  c3  +  c^  +  3c^  +  2c^  +  2c^  ■  0 

i.e.  there  are  linear  relations  between  the  variables  due  to  rank 
deficiency  in  the  matrix  S.  In  fact,  for  this  example,  rank(S)  *  3. 
The  wide  spread  of  size  in  the  rate  constants  is  the  reason  of 
stiffness. 


In  real  world  problems  we  cannot  of  course  neglect  the  contributions  from 
transport-  and  energyprocesses  such  as  diffusion,  convection,  changes  in  the 
temperature  etc.  This  seems  to  imply  that  problems  that  can  be  formulated  as 
...(5)  are  academic  and  has  little  to  do  with  real  problems.  However,  the 
model  is  used  in  practise  and  can  be  found  in  application  areas  such  as 
atmospheric  chemistry,  physical  chemistry,  biochemistry,  metallurgy  etc. 

The  size  of  such  problems  can  vary  from  a  few  reactions  and  species  in  e.g. 
physical  chemistry  to  several  hundred  in  biochemistry.  The  systems  can  also 
exhibit  all  kinds  of  numerical  problems  such  as  different  degrees  of  stiffness 
and  instability  phenomena  in  e.g.  oscillating  systems. 


The  model  ...(5),  which  ve  have  discussed  so  far,  is  thus  applicable  to  the 
isothermal  batch  reactor.  Examples  of  reactors  where  transport  and  energy- 
processes  must  be  taken  into  account  is  the  continuous-flow  stirred  tank  reac¬ 
tor  (CSTR)  and  the  plug-flow  tubular  reactor  (PFTR) ,  which  however  can  be 
modelled  by  sets  of  ODE's  obtained  by  making  small  modifications  of  ...(3), 
see  also  Edsberg  [3]  for  further  discussion. For  the  CSTR  the  equations  are 

c  *  g(cf  -  c)  +  (Q  -  P)Kg(c)  ...(8) 

where  6  is  a  constant  and  c^  is  a  constant  vector. 

Another  type  of  generalisation  occurs  in  systems  where  spatial  gradient 
cannot  be  neglected.  This  type  of  model  is  often  used  in  atmospheric  chemist¬ 
ry  and  consists  of  coupled  partial  differential  equations  of  parabolic  type: 

c^  -  -div(cjv)  -  div(D^gradc^)  +  ...(9) 

where  r.  are  the  components  of  a  vector  of  molar  rates,  which  for  mass  action 
kinetics,  is  the  right  hand  side  of  ...(5). 

For  systems  of  ODE's  coming  from  chemical  kinetics  it  is  obvious  that  the 
variation  of  the  varables  c^  is  restricted  to  certain  bounds.  Returning  to 
the  model  ...(5)  which  is  valid  under  closed  conditions  the  total  mass  must 
be  preserved.  This  can  be  expressed  by 

-T-  -T- 

me  *  m  c(0)  *  a  ... (10) 


where  m  is  a  vector  of  molecular  weights  of  the  species  A^.  Thus 

m^c  *0  ...(11) 

which  means  that  c  belongs  to  a  subspace  L  being  orthogonal  to  the  vector  m, 
which  is  directed  into  the  positive  orthant  R*.  The  concentration  vector  c 
then  belongs  to  a  hyperplane  Lq  being  parallell  with  L  and  intersecting  the 
coordinate  axis  at  intercepts  y^>  0. 

Example  A  system  of  three  reversible 

reactions  involving  three  species 

A1  ^VA2 

V/ 

A3 


The  area  0  *  R*  fl  L  is  called  the  reaction  simplex  and  is  the  domain  in  R* 

u  n  a  n 

in  which  a  solution  of  ...(5)  resides.  This  of  course  demands  that  c^(t)  i  0, 
i  *  1,2,...,n,  which  however  is  true  for  mass  action  kinetics  if  c^(0)  2  0. 
Hence  from  ...(10)  we  see  that 

0  &  c^(t)  &  a/m ^ 

which  is  an  a  priori  bound  for  every  solution  of  ... (5) 


...(12) 


Moreover,  since  Che  ODE's  describing  the  kinetics  can  be  set  up  from  ?che 
reaction  network  . . . (3)  there  is  a  lot  of  information  of  the  reaction  struc¬ 
ture  contained  in  the  right  hand  sides  of  ...  (5) .  This  information  has  been 
used  by  Feinberg,  Horn  and  Jackson  in  an  extensive  qualitative  analysis  of 
the  solution  manifolds  of  ... (5) .  An  overview  of  their  work  can  be  found  in 
Feinberg  [4].  The  results  of  their  theory  consist  of  statements  of  the  number 
of  steady  states  and  their  stability  properties.  Their  theory  is  restricted 
to  specified'  classes  of  chemical  reaction  networks,  namely  those  having 
positive  steady  states  belonging  to  the  interior  of  This  excludes  net¬ 
works  containing  such  irreversible  steps  that  lead  to  a  steady  state  on  38^. 

In  other  cases,  however,  the  theory  assures  a  single  positive  steady  state 
which  is  asymptotically  stable  for  networks  which  among  other  conditions  have 
a  generalised  reveribility  character  called  weak  reversibility.  It  would  lead 
too  far  to  give  an  overview  of  their  results  in  this  paper,  but  the  statements 
about  the  steady  state  that  can  be  said  a  priori  is  certainly  of  importance 
when  you  want  to  know  what  can  be  expected  of  the  solution  trajectories  from 
a  certain  kinetics  problem.  To  illustrate  this,  consider  the  following  net¬ 
works,  given  by  Feinberg  [4]: 


...(13) 


A  +  A 
A1  A2 


A3<,—~  2 ^2 


A|  +  A2  ^3  v-  A2 

Of  these  very  similar  three  reaction  schemes  only  the  second  has  a  single 
stable  positive  steady  state  for  all  positive  values  of  the  six  rate  constants 
in  each  reaction  simplex.  The  first  network  on  the  other  hand  may  have  mul¬ 
tiple  positive  steady  states  for  certain  values  of  the  six  rate  constants 
while  the  third  network  cannot  have  multiple  steady  states, but  for  certain 
values  of  the  eight  rate  constants  an  unstable  steady  state  and  a  stable 
limit  cycle. 


-t>- 


3.  The  development  of  software  for  chemical  kinetics 

All  since  the  beginning  of  use  of  digital  computers  for  numerical  simulation 
of  dynamical  systems,  the  problem  class  of  chemical  kinetics  has  attracted 
much  interest  both  among  people  developping  numerical  software  and  among  the 
presumptive  users,  i.e.  the  chemists.  Several  program  packages  have  been 
implemented  during  the  last  two  decades.  Apart  from  the  practical  importance 
and  interest  of  the  problem  itself,  one  reason  for  this  development  is  pro¬ 
bably  the  appealing  structure  of  the  problem  of  chemical  kinetics  which 
among  other  things  makes  it  possible  to  program  a  translator  which  transforms 
the  chemical  network  ...(3)  into  the  set  of  ODE's  ...(5).  Another  reason 
is  perhaps  that  numerical  analysists  in  this  problem  class  have  found  an 
appropriate  test-batch  for  stiff  problems  making  it  possible  to  try  an  ODE- 
method  on  problems  of  varying  size,  stiffness  and  even  instability  phenomena. 

Since  the  first  implementations  of  implicit  methods,  such  as  Gear's  method, 
the  list  of  software  for  chemical  kinetics  has  increased  and  here  we  give  just 
a  choice  of  software  that  has  been  published:  BELLCHEM  by  Edelson  [5],HAVCHM 
by  Stabler  and  Chesick  [6],  KINRATE  by  Edsberg  and  Uhlgn  [7],  FACSIMILE  by 
Curtis  [8],  MACKSIM  by  Carver  and  Boyd  [9],  KISS  by  Gottwald  [10]  and  LARKIN 
by  Deuflhard, Bader  and  Nowak  [ill. 

The  common  features  about  chemical  kinetics  programs  are  that  the  user  can 
work  interactively  with  the  computer  using  an  alpha-numerical  or  graphical 
terminal  and/or  some  plotter.  The  purpose  for  writing  such  programs  is  also 
that  the  user  need  not  be  acquainted  with  the  numerical  methods,  but  should 
be  able  to  concentrate  upon  the  chemistry  and  have  visualized  how  the  kinetics 
of  a  certain  reaction  system  behaves  for  different  values  of  initial  con¬ 
centrations  c(0),  rate  constants  K  or  the  reaction  structure  itself  i.e.  Q,P. 

Up  to  now  we  have  discussed  only  the  simulation  problem,  i.e.  the  behaviour 
of  the  concentrations  as  function  of  time.  One  should,  however,  be  aware  of 
the  fact  that  this  is  just  one  important  computational  aspect  of  chemical 
kinetics.  A  rough  partitioning  of  this  and  other  numerical  problems  is: 

1.  Computation  of  the  steady  state,  i.e.  determining 
c(»)  in  ...(5) 

2.  Simulation  of  the  dynamical  behaviour,  i.e.  solution 
of  the  set  of  ODE's  ...(5) 

3.  Determination  of  unknc  rate  constants  k^  in  ...(3) 
from  experimental  measurements  of  the  system. 

4.  Identification  of  the  mechanism,  i.e.  determining  the 
matrices  Q  and  P  in  ...(5)  from  experiments. 

The  problems  are  probably  given  in  increasing  order  of  difficulty,  but  they 
cannot  be  separated  from  each  other,  e.g.  for  some  problems  c(»)  cannot  be 
solved  without  solving  the  ODE's  and  the  detmination  of  rate  constants  must 


be  based  on  an  efficient  algorithm  for  simulation. 

Returning  to  the  simulation  problem:  when  implemented  on  a  computer  it  is 
natural  to  split  a  chemical  kinetics  program  into  a  number  of  segments,  i.e. 
the  program  package  consists  of  several  parts  which  communicate  through 
data-files.  This  is  the  case  for  e.g.  the  programs  KINRATE,  FACSIMILE  and 
LARKIN,  where  you  can  distinguish  between  the  following  parts: 

1.  TRANSLATOR,  which  transforms  the  reactions  given  as  input  by  the  user 
into  either  a  code  for  the  right  hand  sides  f(c)  in  ..(5)  and  the 
correspondning  Jacobian  3f/3c  or  some  sparse  representation  of  the 
matrices  Q  and  P  (or  S  and  P) ,  from  which  it  is  possible  in  a  later 
phase  of  the  program  to  compute  f(c)  and  the  Jacobian  when  they  are 
needed. 

2.  PARAMETER-AS SIGNER,  which  obtains  as  input  the  rate  constants  k^  and  the 
initial  concentrations  c^CO).  Here  the  user  may  also  specify  certain 
numerical  parameters,  such  as  the  starting  and  final  point  of  integration, 
the  tolerance  for  the  ODE-solver,  scaling  factors  for  the  components, 
possibly  also  initial  stepsize  and  the  largest  allowed  stepsize  in  the 
integration  interval. 

3.  SIMULATOR,  which  performs  the  numerical  integration  of  ...(5).  The  user 
may  here  prescribe  the  times  t^  where  the  solution  c(t^)  should  be  prin¬ 
ted  out.  In  this  part  it  may  also  be  possible  for  the  user  to  decide 
how  to  act  if  the  simulation  for  some  reason  goes  wrong. 

4.  POSTPROCESSOR,  with  the  help  of  which  the  user  may  look  at  the  terminal 
or  plot  on  a  plotter  the  results  from  the  simulation  parts  in  certain 
interesting  intervals  or  examine  more  closely  just  a  few  of  the  com¬ 
ponents  c^  of  the  reaction  system. 

Most  programs  for  chemical  kinetics  are  implemented  in  FORTRAN,  but  the 
program  package  KISS  by  Gottwald  is.  written  in  SIMULA  the  better  structuring- 
possibilities  of  which  has  made  it  possible  to  write  the  whole  program  in 
one  module. 

We  have  already  mentioned  the  possibility  of  translating  a  chemical  network 
into  the  right  hand  sides  f(c)  of  ...(5)  with  the  help  of  a  program.  Since 
the  components  of  f(c)  are  built  up  by  polynomial  expressions,  the  Jacobian 
can  be  formed  analytically  also  with  the  help  of  a  program.  The  formal  ex¬ 
pression  for  the  Jacobian  is 

3f/3c  -  SKdiag(gi)PTdiag(1/ci)  ...(14) 

if  c^  *  0.  The  Jacobian  is  then  used  in  the  Newton-iterations  when  solving 
numerically  the  set  of  nonlinear  equations 

c  -  h6f(c)  +3-0  ...05) 

in  each  time-step.  Here  c  is  the  approximate  solution  at  time  t+h,  t  the 
former  time  at  which  the  solution  was  computed,  8  is  a  method-dependant 


constant  and  d  contains  information  from  the  solution  at  time  t.  Hence  a 
set  of  linear  equations 

Ax  -  (I  -  hB3f/3c)x  -  b  ...(16) 

has  to  be  solved  for  each  Newton-iteration.  This  is  done  by  using  the  LU- 
decomposition  of  A,  L  and  U  being  retained  as  long  as  convergence  properties 
are  satisfactory.  Each  time  hS  is  changed  or  some  elements  of  the  Jacobian 
have  changed  so  much  that  convergence  is  too  slow,  the  LU-decomposition  is 
updated  by  first  recomputing  the  Jacobian  then  computing  the  new  LU-decom¬ 
position  of  A.  Comments  on  the  calculation  of  the  Jacobian  numerically  or 
analytically  is  found  in  Curtis  [12]. 

The  set  of  linear  equations  ...(16)  should  be  solved  by  some  sparse  tech¬ 
nique  at  least  if  the  number  of  chemical  species  is  large.  Sparse  techniques 
are  used  in  e.g.  FACSIMILE  and  LARKIN,  while  the  full  matrices  are  treated 
in  e.g.  KINRATE  and  KISS.  This  sparse  technique  is  a  general  purpose  method 
that  can  be  used  for  any  sparse  system,  thus  no  extra  information  based  on 
the  special  structure  of  chemical  kinetics  problems  is  used.  An  up  to  date 
overview  of  the  solution  of  the  nonlinear  equations  ...(15)  is  found  in 
Soderlind  [13]. 

So  far,  no  attempt  has  been  made  to  compare  different  programs  for  chemical 
kinetics  with  each  other.  However,  comparisons  of  stiff  ODE-solvers  applied 
to  chemical  problems  have  been  performed  and  results  can  be  found  in  e.g. 
Enright  and  Hull  [14]  and  in  Gottwald  [15]. 


4.  The  use  of  scaling  of  the  variables  in  the  set  of  ordinary  differential 
equations  of  chemical  kinetics. 

The  possibility  of  giving  scaling  factors  for  the  variables  c^  in  ...(5) 
to  be  used  in  the  local  error  estimate  of  the  stiff  ODE-solver  has  already 
been  pointed  out  in  section  3.  Different  ways  of  performing  automatic  scaling 
of  the  local  error  estimate  is  discussed  by  Curtis  in  [12]  and  by  Deuflhard 
in  [11  ].  In  some  numerical  experiments  reported  by  Edsberg  [3]  it  was  found 
that  for  certain  chemical  systems  some  scaling  device  seemed  to  be  necessary 
in  order  to  obtain  reliable  approximations  of  the  solution  trajectories  at 
least  when  the  machine  accuracy  and/or  the  tolerance  used  in  the  numerical 
integrator  was  not  small  enough.  In  [3]  the  scaling  factors  were  chosen  from 
a  priori  knowledge  of  the  size  of  the  variables  c^  and  scaling  was  tested 
both  by  transforming  the  set  of  ODE's  ...(5)  to  a  set  of  ODE's  for  the  scaled 
variables  and  by  using  different  weights  in  the  norm  of  the  local  error 
estimate  used  to  control  the  stepsize  strategy.  In  both  cases  the  results 
were  satisfactory  when  tried  on  some  stiff  ODE-solvers  if  scaling  was  used , 
but  unreliable  if  no  scaling  was  used  (corresponding  to  using  an  absolute 
error  tolerance).  Similar  results  are  reported  by  Curtis [12]. 

A  more  systematic  approach  to  the  scaling  problem  is  given  by  Dahlquist, 


Edsberg,  SkSllermo  and  Soderlind  [16],  who  present  a  semi-automatic  algo¬ 
rithm  for  scaling  the  variables.  The  algorithm  is  tested  on  some  chemical 
kinetics  problems  and  the  scaling  procedure  emerges  in  a  singular  perturbation 
formulation  of  the  set  of  ODE's  ...(5) 


where  x  ■  - 

x„ 


x,  -  S^CXj^.e) 
* 

ex2  -  S2f2(x1,x2,e) 


Xj  (0) 

x2(0) 


..(17) 


and  x  is  a  vector  of  the  variables  c.  scaled  by 

l 


certain  scaling  factors,  x  and  S  are  then  partitioned  according  to  the  non¬ 
stiff  variables  x^  and  the  stiff  variables  x2»  c  is  a  small  parameter.  The 
system  ...(17)  is  then  treated  with  a  stiff  ODE-solver. 

The  scaling  procedure  is  simply  based  on  a  diagonal  scaling  of  the  dependant 
variables  c.  and  the  time  t: 

l 


t  -  e0r 


...(18) 


E  is  a  diagonal  matrix  with  e^  in  the  diagonal,  i*1 ,2, . . . ,n.  With  this  trans¬ 
formation  inserted  into  ...(5)  we  obtain 


Ex'  -  SKg(x)  ,  x(0)  -  E_1c(0)  ...(19) 

where  k  «  eQKG(e)  ,  G(e)  -  diag(g.(e))  ...(20) 

where  e  is  a  vector  of  the  e^,  i*1,2,..,n,  and  '  denotes  differentiation  with 
respect  to  t.  Thus  only  the  rate  constants  k^  and  the  initial  values  c^(0) 
are  affected  by  this  transformation;  neither  S  nor  g  is  changed  (provided 
E  is  kept  on  the  left  hand  side  in  front  of  the  derivatives).  In  this 
transformed  system  ...(19)  we  now  try  to  use  the  scaling  factors  to  identify 
which  variables  are  stiff  and  which  are  non-stiff. 

The  way  of  finding  the  scaling  factors  e^,  i«*0,1,...,n  is  outlined  by 
Dahlquist  and  Mao-Zu-Fan  [17]  and  is  based  on  the  following  assumptions: 

1 .  In  a  time-interval  [Tj,t2]  where  the  scaling  is  performed  the  scaled 
variables  x^  and  their  derivatives  x^'  are  of  the  same  order  of  magni¬ 
tude  0(1). 


2.  The  vector  x  is  partitioned  into  two  vectors  x^  and  x^-  For  the  vector 
Xj  (the  non-stiff  variables)  the  derivatives  x^'  balance  at  least  one 
term  on  the  right  hand  side.  For  the  vector  x2  (the  stiff  variables) 
there  is  balance  between  terms  of  opposite  sign  in  the  left  hand  side. 
(Balance  here  means  that  two  terms  are  of  the  same  order  of  magnitude) 


If  the  scaling  procedure  succeeds  in  balancing  terms  the  system  will  be 
transformed  into  singular  perturbation  form.  In  [17]  there  is  outlined  an 
algorithm  which  gives  the  scaling  factors  e^  provided  the  "user"  gives 
suggestions  of  which  terms  balance  each  other  in  all  equations.  The  problem 
can  then  be  formulated  as  a  linear  program  which  tries  to  find  the  e. 


that  match  the  balance  conditions  as  well  as  possible. Work  is  now  going 
on  at  our  department  which  aims  at  designing  an  automatic  scaling-procedure. 
We  illustrate  the  scaling  procedure  here  on  the  problem  ...(7)  and  the 
advantages  we  get  of  finding  the  transformed  system  ...(17) 

With  rate-constants  and  initial  values  inserted  and  after  using  one  of 
the  linear  relations  between  the  c^'s  the  system  can  be  written: 


-7.89*1<f  10Cl  -  1.1* 107c1c« 

-10  1  q  1  J 

7.89*10  c,  -  1.13* 10  CjCj 

1.1«107ctc3  -  1  - 13 • 103c4 


,  c1 (0)  -  1.76*10 
,  c2(0)  -  0 


,  c,  (0)  -  0 


..(21) 


c2  "  C3  +  c4 


Let  the  scaling  factors  e^  be  expressed  in  powers  of  e,  being  a  small  posi¬ 
tive  given  number  o.  ,-.v 

*  £  1  • • • \22 j 

The  problem  is  now  to  make  a  hypthesis  in  each  equation  about  the  leading 
terms  which  balance  each  other.  In  the  first  equation,  since  both  terms  on 
the  right  hand  side  have  the  same  sign  one  of  them  (or  both)  balance  the 
derivative,  let's  assume  the  second  term  does.  Assume  also  that  c^  and  c^ 
are  stiff  variables  hence  the  terms  on  the  right  hand  sides  balance  each 
other  in  the  last  two  ODE's.  In  the  algebraic  equation  assume  that  all  terms 
are  of  the  same  magnitue,  i.e.  both  terms  on  the  right  hand  side  balnce  the 
left  hand  side.  Finally,  guided  by  the  magnitude  of  the  initial  value  let 
e  -  W'3. 

The  balance  assumptions  now  give  the  following  relations: 

_  ,  a.+a„ 


cV°o  -  V  \  3 

klEa1  -  k2ea2  a3 

.  a.+o,  ,  o. 

k.e  1  3  «  k.e  4 

3  4 


...(23) 


“2  3  4 

With  the  known  numerical  values  inserted  into  ...(23)  and  rounding  every  rate 

konstant  to  the  closest  power  of  ten  we  get  the  following  set  of  linear 

equations:  3aQ  -  7  -  3^ 

-10  -3a,  -  9  -  3a.  -  3a. 

1  ^  ^  (21 
7  -  3a,  -  3a3  -  3  -  3»4  " 

a2  "  °3  “  °4 

The  closest  integer  solution  of  ...(24)  is  Oq  ■  -1 ,  a,  *  1 ,  a2  *  a^  *  a^  *  4 
which  gives  the  following  set  of  scaled  ODE's 


xj  ■  0.789e  x,  -  0.011x,x2 

ex2  ■  0.787x,  -  0*01 1x2x^ 

e2x.'  »  llx.x.  -  1.13x. 

4  13  4 


x2  "  X3  +  x4 


where  t  »  10  t,  c,  ■  10  x, ,  c^ 


10“12x.,  i-2,3,4 


x, (0)  -  1.76 
x2(0)  -  0 
x4(0)  -  0 


...(25) 


From  the  equations  ...(25)  we  now  have  good  information  of  the  qualitative 
behaviour  of  the  solution  and  we  have  also  obtained  a  natural  scaling  of 
each  variable  where  1  is  a  proper  measure  of  each  variable  in  a  time  interval 
where  the  variables  X2  and  x^  have  reached  a  nearly  steady-state.  We  see 
that  the  transient  phase  consists  of  one  very  quick  phase  determined  by 
and  one  less  fast  interval  determined  by  e.  Hence  we  get  a  priori  infor¬ 
mation  of  what  can  be  expected  of  the  stepsize-regulation  of  a  stiff  ODE- 
solver. 

So  after  this  "analytical  preprocessing"  of  the  system  the  problem  is  now 
well  suited  for  being  treated  by  a  stiff  method,  and  with  very  moderate 
value  given  to  the  tolerance  parameter  we  obtain  the  solution  with  good 
relative  accuracy. 

Thus,  if  it  is  possible  to  find  a  scaling  that  transforms  the  set  of  ODE's 
...(5)  into  the  form  ...(17)  we  have  gained  the  following  advantages: 

1 .  We  have  better  insight  into  the  dynamics  of  the  problem 

2.  The  problem  is  better  suited  for  numerical  treatment  -with  a  stiff 
numerical  integrator;  we  have  variables  that  are  scaled  so  that  1  is 
appropriate  measure  of  the  size  of  all  ariables. 

3.  The  partitioned  form  ...(17)  makes  it  possible  to  use  numerical  methods 
that  utilize  this  form,  see  Sftderlind  [13]. 

5.  Conclusions 

In  this  paper  we  have  given  a  brief  survey  of  some  mathematical  and 
numerical  properties  in  chemical  kinetics.  We  have  also  given  some 
examples  of  existing  software  for  interactive  simulation  on  a  computer 
by  using  stiff  numerical  integrators.  It  is  further  indicated  that  the 
special  structure  of  the  set  of  ordinary  differential  equations  of 
chemical  kinetics  makes  it  possible  to  transform  the  equations  into  a 
singular  perturbation  formulation  by  using  a  scaling  procedure.  The 
transformed  equations  are  then  used  in  the  numerical  integration,  giving 
more  reliable  results.  At  our  department  this  scaling  procedure  is  in¬ 
vestigated  with  the  aim  to  develop  an  algorithmic  description  which  uti¬ 
lizes  all  information  that  can  be  deduced  about  the  problem  before  perfor¬ 
ming  the  numerical  integration.  In  this  paper  we  also  point  out  that 
other  numerical  problems  are  of  great  importance  for  chemical  kinetics, 
i.e.  the  identification  problem 
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AN  OVERVIEW  OF  STIFFNESS  PROBLEMS  IN  NUCLEAR  REACTOR  KINETICS 


J.  DEVOOGHT  (St) 


1.  INTRODUCTION 


The  Integration  of  the  reactor  kinetics  equations  is  a  tine-  consuming  opera¬ 
tion  whenever  it  is  performed  in  the  full  spaoe-energy-time  frame.  Moreover 
when  coupling  of  the  neutronic  parameters  with  thermohydraulic  parameters  is 
present,  the  problem  is  even  mare  taxing  for  the  oonputer.  A  large  nunber  of 
approximate  integration  schemes  have  therefore  been  studied  to  reduce  die  nu¬ 
merical  problem  to  a  manageable  size.  However  neutron  kinetics,  despite  its 
large  spread  of  eigenvalues  is  not  typically  a  difficult  stiff  problem.  Mo¬ 
reover  special  techniques  have  been  devised  to  oope  with  the  particular  struc¬ 
ture  of  the  kinetics  equations,  owing  to  two  facts  : 

(1)  Neutron  kinetics  is  a  linear  evolution  problem,  which  becomes  non  linear 
in  so  far  as  it  becomes  a  neutron  dynamics  problem  through  coupling  with 
other  variables. 

(2)  Stiffness  can  more  or  less  be  "factored  out",  i.e.  if  we  write  the  neutron 
density 


N(r,v,t)  -  T(t)i|i(r,v,t)  (1) 

a  factorisation  used  in  "quasistatic  approximation",  a  suitable  function 
can  be  obtained  such  that  the  stiff  behaviour  is  more  or  less  confined 
to  T(t) . 


(X)  university  Libre  de  Bruxelles,  Brussels,  Belgium. 


One  way  to  cope  with  stiffness  problems  is  to  use  the  singular  perturbation 
method,  which  has  been  known  in  reactor  kinetics  as  the  prompt  juqp  approxima¬ 
tion.  Goldstein  and  Shotkin  ^  tried  to  extend  idle  prompt  jump  approxi¬ 
mation  to  hi^ier  orders  but  their  results  based  upon  intuitive  derivation  are 
valid  only  for  the  first  order.  The  systematic  application  of  the  singular 
perturbation  method  to  reactor  kinetics  was  first  proposed  by  Hendry  and 
Bell  ^  who  derived  the  algorithm  including  the  intermediate  expansion  by  a 
purely  heurestic  argument. 

Hscent  developments  have  been  given  by  Mika,  Blenski  and  Gadcmski  to  7> . 

The  method  seems  attractive  for  point  kinetics  and  some  non  linear  kinetics 
problems.  So  far  limited  numerical  experience  is  available  and  we  shall  not 
develop  this  point  here. 

Various  approximations  have  been  devised  :  at  cne  end  of  the  spectrun  we 
"point  kinetics"  (a  system  of  seven  linear  differential  equations)  where 
reactor  is  acting  as  a  whole,  i.e.  a  single  point  .,  at  the  other  end  of 
spectrun  we  have  a  full-fledged  3D  space  and  energy  problem  which  could 
ve  10^  unknowns.  Intermediate  options  involve  coarse  mesh  methods  or  fl 
synthesis  methods  which  we  group  under  the  heading  of  "multipoint  reactor  kine¬ 
tics". 

Advanced  work  in  numerical  reactor  dynamics  is  mostly  related  to  safety.  The 
need  for  fast  and  reliable  computing  schemes  for  neutron  transients  stems  frcm 
the  fact  that  neutron  fluxes  yield  the  thermal  power  that  drives  a  set  of 
thermchydraulic  equations.  By  far,  the  latter  are  the  more  difficult  and  more 
time  consisting,  and  neutron  kinetics  is  not  anymore  the  main  problem.  However 
two  points  remain  pressing  :  hew  to  lengthen  the  time  step;  hew  to  bound  the 
error.  Generally  speaking,  development  has  been  outside  the  main  stream  of 
OCE  stiff  solvers,  maybe  for  lade  of  familiarity.  Some  progress  has  been  made 
for  the  first  point  but  adequate  schemes  are  sorely  lacking  for  the  second 
point.  Benchmark  problems  allow  a  common  point  of  comparison,  but  so 
far  the  lade  of  analytical  solution  for  complicated  problems  does  not  allow  to 
distinguish  the  convergence  from  a  possible  bias  common  to  various  methods. 


J  *  S  1  t 


It  Is  scce time  implied  that  the  need  for  very  accurate  solution  of  reactor 
kinetics  problems  is  not  obvious  because  of  the  very  large  uncertainties  in 
the  thermohydraulics.  Howerer  time  step  lengthening  is  of  utmost  inportance, 
and  is  limited  by  the  fact  that  kinetic  operators  are  not  constant. 

The  lack  of  memory  space  for  large  problems  makes  multistep  methods  rather 
unsuitable,  and  therefore  one  step  methods  sire  almost  the  rule. 

As  it  may  be  surmised  almost  all  methods  make  implicit  or  explicit  use  of 
collocation,  Galerkin  or  interpolator/  quadrature,  and  therefore  of  an  assumed 
time  dependent  behaviour  of  the  solution,  usually  linear  combinations  of 
exponentials  and  polynomials.  Polynomial  however  are  not  very  suitable  for 
fast  transients  of  exponential  type  and  some  methods  place  increasing  reliance 
on  spectral  matching,  i.e.  the  cne-step  growth  function  W(hA)  which  approxima¬ 
tes  e  is  chosen  with  the  property  that 

hX, 

*»  e  1 

where  the  X,  's  are  seme  eigenvalues  (or  approximations  thereof)  of  A. 


2.  THE  NUCLEAR  REACTOR  KINETICS  PRCBIZM 

2.1.  Multigroip-multipoint  kinetics  equations 

Boltzmann  equation  reads 

-► 

||  =  A(t)  ?(t)  +  S(t) 

where  ?(t)  =[  ,  Fx(r,v,t)...  F^r^t)]1 

with  the  following  notation  : 

(a)  r,  v  =■  vn  ,  |n|  *  1,  t  resp.  position,  velocity  vector,  and  tine 

(b)  0(r,v,t)  :  neutron  flux  density 

(c)  Fi(r,v,t)  ^  x±(v)  C±(r,t) 

where  C^(r,t)  is  the  precursor's  density,  i.e.  the  isotope  that  yields 
delayed  neutrons 

00 

X,(v)  the  fission  spectrum  normalized  by  /  x^Mdv  *  1 
x  Q  x 

v  the  number  of  fission  r.  outruns  by  fission 

(d)  §  ^  [Q(r,v,t) ,  0...0]T 

with  Q  the  prcrrpt  neutron  independent  source,  usually  zero. 


Matrix  A  is  defined  by 


with  J,  the  fission  operators,  and  K  the  destruction  operator,  respectively 
defined  by 

09 

Ji  *  TiT  xi^  t  &r*  t  Zf(r,v')...  dv'dfl,  i«0,l...I  (5) 

o  4ir 


K  ■  nv  +  E.(r,v)  -  /  dv'  /  I  (r,v'-*-v) .. .dfi'  (6) 

Z  O  4ir  3 

where  l^(r,v)  ,it(r,v)  ,Eg(r,v'-*v)  are  respectively  fission  cross-sections, 
total  cross-sections  and  differential  scattering  cross-sections. 

The  fraction  of  neutrons  emitted  instantaneously  is  1-6,  and  the  fraction  of 
delayed  neutrons  is  6  emitted  from  I  (usually  1=6)  precursors.  A  weighted 
fission  operator  is  defined  by 

.  I 

J  *  (1-0) J  +  I  &.J.  (7) 

°  1 

where  subscript  o  is  reserved  for  prompt  neutrons. 

The  reader  may  find  a  good  survey  of  fundamentals  of  reactor  kinetics  as  well 

(8—12) 

as  some  numerical  aspects  in  references  .  This  overview  of  stiffness 
problems  is  by  no  means  a  review  of  numerical  reactor  kinetics  and  references 
have  been  therefore  limited  to  the  few  topics  examined  below. 


Usually  kinetics  problems  axe  solved  in  the  diffusion  approximation  frame  ^  '10'; 
Full  transport  kinetics  problems  present  problems  of  their  own  which  are  not 
examined  here^1^ .  The  energy  variable  is  discretized  into  G  "groups"  and  the 
space  variable  into  K  "points".  Operators  J  and  K  become  KG  x  KG  matrices 
and  Ai  becomes  A^  times  a  unit  matrix.  Ihe  system  of  multi  group-multipoint  neu¬ 
tron  kinetics  has  the  same  structure  as  (2)  (4)  where  t  is  a  vector  with  KG(I+1) 
components. 


A  Galerkin  type  approximation  yields 


[A]"1  Ip-fll 


d 

at 


^(t) 


A±[  1]  .... 


[a]”1  lej 


-Ajll] 


$(t) 


2±(t) 


(8) 


where  [a]  ,  [p  ] ,  [  g]  are  KG  x  KG  matrices  defined  by 


l°lkt  4  <V  <VK)V 


lsi’k£  ■  \  ViV 


-1 


u,kt 4  <v v''  V 


(9) 

(10) 

(11) 


*  “  KG  KG 

with  < ,  >  a  scalar  product  in  r  fi  v  space  and  {W^}^  ,  two  sets  of 

linearly  independent  KG  functions  such  that 


<wk'  V  -  °k  \ 


(12) 


Mika  v  '  and  Porsching  '  '  have  attacked  the  problem  rigorously  for  mono- 

energetic  plane  transport  and  multigroup  diffusion,  respectively.  Unfortuna¬ 
tely,  Porsching' s  assumptions  are  unrealistic  for  G  >  1. 

If  we  make  the  assumption  of  an  identical  fission  spectrum  x  for  all  precursors 
,  the  eigenvalue  equation  is 


(oil  +  vK )<t>  *  f(w)v  vftf  (16) 

where  v  is  the  neutron  velocity,  K  and  F  =  JQ  *  J\  the  destruction  and  fission 
operators,  respectively 


A  1 

f  (u>)  ^  1  -  I  X 


.  ,  X.  +  u 

i=l  i 


(17) 


-1 


We  define  the  eigenvalues  v/  [  v^w)]  of  (  ul  +  vK)  vF 


—  0^(u j)  =  (oil  +  vK)  1  vPfi^(u) 


(18) 


The  values  of  (  v .  (<u)  ]  /v  for  u=o  correspond  to  the  lanbda  modes 

A  /  i  Q\ 


(18) 


the 


values  of  u  for  (  (u)  ]  /v 


(18) 


1  correspond  to  the  omega  modes  .  Let  be 
the  eigenvalues  of  matrix  vK.  The  fission  operator  is  proportionnal  to  a  pro¬ 
jector  P  : 


|vx>  <vlf| 

VF  -  ctP  ,  a  =  <vlf|vx>  ,  P  =  Ttqivr 


(19) 


Ihe  characteristic  equation  reads 


Each  matrix  [  a±]  or  [  aj  is  K  x  K  and  [  aj  [  b.  j  is  the  (i,k)  element  of  a 
block-partitioned  matrix  in  blocks . 


Each  principal  minor  of  rank  p  >  K  of  |[  aj  [  k^]  |  is  zero;  therefore  eq.  (20) 
the  characteristic  equation  of  |  [  aj 


[  b^]  |  reads 


X10* 


0 


(21) 


Eq.  (18)  has  K  ei<^n values  ( vi  (u)  ]  /v  ,  which  are  algebraic  functions  of  w  . 

If  K=l,  the  unique  eigenvalue  (  ^(ui)]  /v  is  a  rational  function  of  w. 

Ihe  eigenvalue  problem  (eq.  (16))  is  the  perturbation  of  a  slcwing-dcwn  problem 
by  a  degenerate  operator  vF.  We  can  apply  the  Weins tein-Arcns  za  j  n  theory  : 
let 


W(uj)  A  det  [I  -  vvF(wI  +  vK)"1] 


(22) 


Ihen 


KG  id  —  9, 

w(u)  =  n  - - 

bi“’\ 


(23) 


Where  <jk  and  are  the  eigenvalues  of  vK  and  vK  -  vvF,  respectively.  It  can 
be  _  -Twn  that 


K 

w(u»)  =  n 
i*l 


m 

n 


'i(“k’ 


If  K=1  and  vF,  vK  are  symmetric  matrices  (unrealistic  assumption  for  G  >  1) , 
then  the  zeros  and  poles  of  W(oi)  are  real  and  alternate,  and  the  v,  (u)  are  all 
real.  This  is  the  case  examined  by  Porsching  ' .  We  new  turn  the  delayed 
neutrons  in. 

The  KG(I+1)  eigenvalues  are  divided  into  two  sets  : 

(a)  K(G+I)  eigenvalues  of  eq.  (16)  are  solutions  of 


vi(w) 


Sane  of  these  roots  are  reed. 


P  1  :  (a)  [v7(aj]/v  is  real ,  simple,  bounded,  smaller  than  llv.fwJl/v  , 

" '  1  % 


i  >  1 

(b)  [\)j(u)]/\>  is  monotone  increasing  for  w  > 

(c)  if  N  >  1  eigenvalues  [v.lWj/v  are  real,  where  p  <  N  numbers 
[\>i(o)]/v  <  1, 

then  p  roots  of  eq.  (16)  are  positive  and  N-p  are  negative,  all  in  the 
interval  ('-X^<»J.  Moreover ,  there  are  N  roots  in  each  interval 
-X .)  and  N  in  (w  ,  ~\r),  where  X.  is  the  decay  constant  of  the  i'th 
delay ed-neutr on  precursor. 


In  practice  we  can  always  take  uQ  «  -A^ .  All  statements  about  the  non- 
real  roots  of  eg.  (26)  are  dependent  on  generalization  of  the  Gershgorln 
theorem  and  will  not  be  dealt  with  here. 

(b)  Each  -A^,  i*l...I  is  degenerate  of  order  KG-K. 

However  their  eigenmodes  can  never  be  excited.  Moreover  when  oi,  -X  are  not 
eigenvalues. 

If  we  disregard  the  -A^  eigenvalues,  we  have  G+I  clusters  of  K  eigenvalues. 

How  stiff  are  reactor  kinetics  equations?  The  answer  depends  on  the  neutron 
lifetime  A  in  the  reactor. 

-4 

A  thermal  reactor  with  A  =*5 . 10  s  with  p  =+l  0  has  its  7  eigenvalues  (point  kine¬ 
tics)  between  -4.4  and  2.  A  fast  reactor  with  A  *10”^  s  with  p  »t  0.5  $  has 

4 

its  7  eigenvalues  between  -2.2  10  and  +  1.5.  A  glance  at  figure  (1)  will  shew 
the  spread  of  the  eigenvalues  for  the  fast  reactor  .  For  multigroup- 
multipoint  kinetics  the  spread  can  be  much  higher. 


3.  INTEGRAL  FORMULATION  OF  REACTOR  KINETICS  WITH  SOME  CLASSICAL  INTEGRATION  SCHEMES 


Let  the  solution  of  the  kinetic  system  :  &  *  A(t)y(t)  +  §(t)  be  caS:  into  the 
Integral  fonn^1^  ; 

y(t)  -  T(t)y(o)  +  /  T(t-x)S(x)dx  +  /  [a  ^4*  —  +  T(t— t)M(t) ]  y(x)dx 

o  o 

+  /t[(l-a)  +  T(t-x)N(x)]  y(x)dx  (27) 

where  A(t)  Is  split  into  A(t)=*M(t)+N(t) ,  a  being  a  nunber. 

The  matrix  operator  T(t,x)=iVrl(t)V(x)  ,v(o)»  I  (28) 

and  where  V(t)  is  some  matrix  operator  whose  inverse  is  defined  for  0  $  t  $  h. 

A  large  nunber  of  approximate  integration  schemes  for  multipoint  kinetics  can  be 
derived  from  the  integral  equation  (27)  according  to  the  choice  of  V(t)  (or 
T(t,x),  a  and  M(t)  lor  N(t)]  ,  as  well  as  the  choice  of  an  interpolant  for  y (x) 
which  may  be  different  for  each  integral  on  the  right  hand  side  of  (27)  and  the¬ 
refore  entails  the  splitting  of  A(t) . 

For  instance  let  T(t)  »  ewtI,  applied  to  (27)  with  a  =1,  M(t)=A(t) . 

0(t)  -  ewt  0(o)  +  f0  eu(t“T)  [  -u,)0(x)  +  E  XiCi(x)]dx 

(  (29) 

-X.t  t  -X. (t-x)  6 . 

C±(t)  *  e  1  Ci(o)  +  /  e  1  l-i  0(x)]dx 


if  we  choose 


An  equivalent  formulation  of 


(29)  is 


♦  Ct)  =  e“t*<o)  ♦  /  [dli=S-  -  „l*(T)dT  +|£^i  /[.“«”>  -  e'^(t'T) 

_  A  A  o»+A. 

O  i  O 


X  0  (t)dT 


A,  ■  -A.t 

+  l7$r  ci<°»(e  -®  > 


(30) 


A  few  well  known  methods  stem  from  either  (29)  or  (30)  if  we  apply  Galerkin, 
collocation  or  interpolator  methods. 


1°/  Adler's  choice 


(22) 


is  a)  *  -  —  and  the  lntezpolants  are  constants  : 

0(t)  ■  0,  C.  (t)  “  C. .  An  average  reactivity  P  is  defined  by 

;  -  t  e*w,l.)a t//  e"(h',)d,. 

o  o 

2°/  She  prcnpt  jutp  approximation  is  obtained  ky  Hetrick  if  w  -  -  for*  one 
precursor. 


3°/  Cohen's  choice ^ 24 ^  isu>*  0  ^  6  with  an  interpolant  assuming  A(t)y(t)  cons¬ 
tant  over  [  o,h/2]  and  varying  linearly  over  [  h/2,h]  . 

4°/  Brittan ' s  method^ 23 ^  applied  to  (30)  is  u  =o  with  a  linear  interpolant. 

5°/  Kaganove's  method^ 26 ^  applied  to  (30)  is  u  -  ~~^°1  ~8 •  the  reactivity  being 
assumed  quadratic  in  time.  Another  choice  incorporates  an  additional  ex¬ 
ponential  factor  for  the  quadratic  interpolant.  In  the  same  group,  we 

(27) 

have  Vblodka'  s  method  which  assumes  parabolic  behaviour  for  p/A  , 

B/A  and  0. 

(28) 

6°/  Hansen's  method  1  is  obtained  if  we  choose  u>  as  the  largest  eigenvalue 
of  the  inhour  equation  for  a  suitable  average  reactivity,  and  with  interpo- 
lants  s  0(t)  »  0(o)e<llt  and  C^t)  *  C^(o)eut. 


/  29} 

7°/  Kang  and  Hansen's  method  uses  both  equations  (29)  and  (30)  with  w-o 
with  Hermite  interpolants  .  For  further  developments  and 
extensions  see  §  4.3. 

8°/  Taylor  series  methods  are  usually  inefficient  for  stiff  systems.  Vigil 

used  a  process  of  analytic  continuation  by  successive  evaluation  of  the  deri¬ 
vatives.  In  the  constant  reactivity  case  for  example 

N(t)  ■  e(p-6)/,A  t  N(o)  +  /  e{p"6)^A{t"T)I  X^ (k)  (o) Tkdt  (31) 

o 

(k) 

where  the  (o)  are  evaluated  recursively  by  means  of 

C{k+2)  (t)  -  aic|k+1)  (t)  +  bic|k)  (t)  +  dL  Z  A^00  (t)  (32) 

with  a^*A~1(p-6)  -  X^,  bj*A-^(p-8)  X^,  di“A_10i.  Equation  (32)  is  obtained 
after  differentiation  of  (8)  (KOI)  and  elimination  of  T(t)  and  ^  (t) . 

The  drawback  is  usually  the  cunberscme  derivative  handling  if  the  reactivi¬ 
ty  is  not  constant  but  at  least  piecewise  analytic.  However  Vigil's  method 

(34) 

is  one  of  the  rare  instances  where  a  step  control  is  offered.  Seeker 
extended  the  method  by  improving  the  choice  of  the  integrating  factor  to 
minimize  the  remainder  or  maximizing  the  integration  step  size. 


A  straightforward  use  of  Galerkin's  method  is  proposed  by  Fuller,  Meneley  and 
Hetrick for  a  multipoint  model  with  J<,(t)**(t-ti)k/  k*l...n.  The  quadrature 
is  done  exactly,  the  reactivity  being  assured  quadratic  in  time. 

Interesting  numerical  comparisons  between  sere  of  these  older  methods  has  been 
given  by  Szeligowski  . 


4.1.  A-stable  rational  approximation  of  the  exponential 


There  is  no  need  to  justify  here  the  importance  of  A-stability  for  stiff  problems 
fully  treated  elsewhere  in  this  conference.  Some  published  work ( 37-39 J  (2l)  ,on 
numerical  reactor  kinetics,  specifically  use  A-stable  (A-acceptable)  rational 
approximation  of  the  exponential. 

The  problem  of  generating  A-acceptable  functions  has  an  old  history. 

Indeed  the  problem  of  constructing  the  totality  A-acceptable  functions  W(z)  sa¬ 
tisfying  interpolator  conditions  in  |z|  <  1  at  arbitrarily  chosen  points 

was  essentially  solved  for  more  than  a  half  century  by  Schur  and  Nevanlinna 
(40) 


with 


let  us  define  a  sequence  of  functions 
“k  "k-l(z)  "  WKk_1) 


t^(Z)  =  (Z-Zj^)  (1-Zj^Z)”1 


WQ(z)  —  W(z) 


<v)  5  w 


k^l...n 


(33) 


(34) 


k“v+l,  v+2,...n 


The  real  positive  constants  are  arbitrary  except  for  M  =  $ 

w  (z)  is  the  solution  of  the  problem. 


«2 


Mn  *  1. 


P  2  ;  The  SN  problem  admits  at  least  one  solution  if  one  of  the  two  situations 


occurs  : 

1. 

<Mk> 

fe*  2  •  •  •  u  j 

\w(u)\ 

1  u*  V 

—  u  .  —  y 

Vl  J  Vi  “  -  n 

2.  \V(£-V\ 

<Mk> 

k=l*..n  • 

In  the  first  case,  the  solution  is  unique ,  with  \W(z)\  £  M  for  |z|  $  1  (33),  and 
W(z.)  =  W(.o),  i=l...n. 

t* 

In  the  second  case,  the  solution  is  not  unique  :  any  initial  function  with 

\Wn(z)\  s  Mn  on  \z\  -  1  is  suitable  and  yields  a  solution  W(z)  satisfying  (3S). 
If  Wn(z)  is  rational,  then  W(z)  will  be  rational. 

Property  2  Is  a  slight  extension  of  a  property  proven  by  SN^4^  . 

P  3  :  Given  { z and  [wi°^  }  ,  i=l,..,n  there  is  a  minimum  value  of  M  to 

which  corresponds  a  unique  solution  of  SN  problem.  The  algebraic  equation  for 
the  determination  of  the  smallest  M  is  \rfn  ^1  -  \W  (z  )\  =  M. 

We  report  below  scroe  unpublished  work  on  this  subject.  See  however  ref.  (21) . 

P  4  :  A  sufficient  condition  that  there  exists  a  function  W^fz)  of  modulus  $  M 
on  |z|  =1  which  takes  on  the  values  at  z=z^  is  that  are  the  values 

taken  at  z.  by  a  bounded  analytic  function  o(z),  with  |o(zl|  $  N  along  |z|  =  1 

Let  z  *  u-a/u+a,  a  real  <  o  be  the  conformal  mapping  of  the  complex  left  half 
plane  Reu  $  o  an  the  closed  unit  disk  |z|  *  1.  Our  original  problem,  i.e.  to 
find  A-accep table  approximations  Wn(z)  to  o(z)  =  e~a(z+l/z-l)  eu  at  a  finite 
number  of  arbitrarily  given  points  (z. )”.  o (z)  is  a  bounded  function  in  |z|  <1 
but  has  an  essential  singularity  at  z  »  1  and  o  (e  )  *  1  almost  everywhere. 


Ibis  is  no  restriction  to  P  2.  We  define  functions  W(z)  with  property  M  i: 


W(z)W(l/z)  -  M2 


(36) 


Analytic  function  in  |z|  $  1  satisfy  property  M  if  they  have  a  constant  modulus 
on  |z|  *  1. 

P  5  .*  Let  W(z)  be  a  rational  flotation  of  z  with  \W(e  )\  $  M  which  interpolates 
a  n  arbitrarily  given  points  { a.},  i=l...n  inside  the  unit  circle  a  function 
o(z)  satisfying  property  N.  Then  W(z)-c(z)  has  also  n  zeros  |z|  >  l 

and  there  is  a  positive  constant  independent  of  M  and  N  such  that 


*  -J-  <  L  |M| 


Returning  to  the  plane  u,  we  see  that  if  to  each  interpolation  point  u^ 

in  Reu  {  o  is  associated  another  interpolation  with  v^. 


P  6  :  If  the  rational  function  W  (z)  satisfies  the  condition  of  P  4,  it  can  be 

O  (yI+1) 

made  to  interpolate  a(z)  at  an  additional  point  provided  vn^zn+^  =  an 

*th  VW  s  V 

If  the  equality  applies  and  M *  'Mn*  Vnz^  aonBtant  an^  VQ^Z^  t^te 
unique  function  of  least  maximum  modulus  in  Reu  $  0. 


This  additional  point  has  no  counterpart  :  we  have  a  total  of  2n+l  interpolation 
points.  In  practice  we  have  to  interpolate  eu  with  vrtu  at  the  points  u^-h*^ 
when  h  is  the  time  step.  When  h  -*■  o,  the  interpolation  points  move  towards  the 
origin  and  therefore  the  mininun  value  of  M  given  by  P  3  is  a  function  of  h  with 


(h)*l.  Let  now  W(z)  interpolate  o(z)  with  |W(z)  |  <  M  on  | 
Mmin^h)  *  at  2n+1  points  z^»  (hxk-a)  /  (hX^+a) . 


z 


■  1  with 


P  7  :  For  h  sufficiently  smallt  \ W(z)-a(z) |  <  c  h  |n  (X-X  J|  with  u=Xh. 
When  \  .—Of  W(z(u))  is  the  diagonal  Padi  approximation  to1  ez . 

I 


The  general  first-order  (W^  :  first  degree  of  numerator  and  denominator  appro- 
ximant  of  e11)  is 


t: 

I 

j 

s 

* 


n 


W1i(u!u1,u2,M2)=  m2. 


U  U 

u(e  1  +  WA)  +  u2(e  1  -  W2) 
uCM2  +  W2e  l)  +  u^M2  -  W^**1) 


(38) 


with 


W.-M2  5 


+  U, 


U1  "2 
e  -  e 


"2'U‘  eUl^  -  M2 


(39) 


Since  eu  =  ea  eu-a,  and  since  -u.  is  another  interpolation  point,  it  is  easily 
checked  that  ft^tulu^Uj,^)  =  ea  W11(u-a|u1-a,u2-a;  1)  (40)  interpolates  eu  at 
*  2a  -  u2. 

However  ft,  2  may  not  be  A-acceptable  for  all  values  of  a. 

The  second  order  W„  can  be  written  readily  by  means  of  (33,34)  but  the  reduc- 

^  (4i\ 

tion  of  well-known  approximations  for  instance  Liniger-Willou^iby  and  N^rsett 
^42^  to  is  no  easy  task^2^  . 


4.2.  Inplicit  Runge-Kutta  with  spectral  matching 


In  principle,  spectral  matching  looses  its  appeal  for  variable  operators.  Howe¬ 
ver  nunerical  experiments  show  that  spectral  matching  improves  accuracy  for 
reactor  kinetics  problems  provided  the  reactivity  is  slowly  varying.  Inplidt 
R-K  methods  yield  growth  functions  that  may  be  identified  for  instance  to  one 
A-acceptable  function  of  §  4.4.  Gear  has  shown  how  to  build  a  R-K  procedure 
associated  to  a  given  growth  function.  We  start  front  2-stage  procedure  : 


Let  8  »  I a± j [  . 


Point  Kinetics  Reactivity  Step  p*  0.5/MPN1)  FR 
P  ' 

Numerical  evaluation  of  the  local  truncation  error  as 

function  of  time  step  for  various  11,1 )-  type  algorithms. 

The  two  ordinates  represent  respectively  : 
in _ 


I /  p  tog  |gE~  I  ,  10*P 

I  ®E  I 

2/l^jh|, i  «  1, .  ..,7  with  u>j  solution 
of  the  inhour  equation 


\ 


Let  p  -  Tr  3 ,  A  »  det  a.  Then  the  growth  function 


is 


E 

9 


1  +  utb.-Hb^-p)  +  u2  (A-p  (b.-*,)  +  (b.c.+b.c,,)) 

W(i*hA)  * - — - « - — - 1  ■=— =-c—  (41) 

1  -  Up  +  u^A 

where  a  is  easily  expressed  In  terra  of  Cj,c2,A#p .  Identifying  W(u)  with  one 
of  the  A-acceptable  approximations  of  eu  of  type  (2,2) ,  we  have  4  equations 
for  6  unknowns  and  we  are  left  with  a  two  parameters  family  of  R-K  schemes 
which  allows  for  Interpolation  at  the  origin  and  two  arbitrary  points  X2 
In  ReX  <  0. 


The  maximum  order  p=4  will  be  preserved  if 

I  b^^  -  +  0(h4"*)  ,  k=o,l,2 

I  r  b  a. .c.  =  1/6  +  0(h2) 

1  j  1  13  3 


(42] 


(43) 

as  well  as  an  additional  condition  given  in  ref. 

If  we  choose  for  W(u),  the  N^rsett  function  T2(a,b,u),  a  special  case  of 

W2,2  : 

T,(a,b,u)  .  1  *  y2(l-V2)u  +  yailjbW 
1  -  1/2  (l+a/2)u  +  1/8  ( 1+fcH-a)  u 


with 

4  tafUj)  -  a  (ux)  ] 

a  *  ui  a^u2)  -u2  <s(ui) 


1  +  b 


4(Uj  -  ux) 

Uj  o^)  -  u1  ojuj) 


a  (u)  »  u2  ( l-eu)  (  (2+u)e"u  +  u  -  2) 


-1 


(44 


then  the  choice  of  4th  order  Butcher  parameters  : 


bx  «  b2  -  1/2 


;  cl,2 


with 


p  “  1/2  +  a/  4 


A  =  (l+a-*)/8 


yields 


a  -  UjfAj)  +  0(h3) 


b  »  -  3  +  30  1^2  +  * 


(45) 


ij  biaijcj  ’  h  "A“  T  "  1/6  "  OT  xia2  +  0(h3) 


The  last  condition  (42)  is  also  fulfilled  because  a^  differ  from  their  optimal 
valves  by  0(h)  terms. 

Therefore  even  for  variable  matrix  A(t)  ,  at  the  cost  of  two  evaluations  A(c.h) 

5 

and  AtCjh)  at  fixed  times,  we  retain  0(h  )  accuracy  with  spectral  matching  at 
0, X  X2« 

The  solution  of  an  m-stage  implicit  R-K  step  requires  the  solution  of  a  linear 
system  of  mKG(I+l)  unknowns.  Precursor  equations  can  be  solved  one  by  one  and 
we  have  in  fact  mKG  unknowns.  This  may  be  very  expensive  even  for  m«2.  If  the 
a  matrix  is  triangular  we  have  only  m  systems  of  KG  unknowns  :  this  is  in  short 
the  principle  of  factorizable  or  Rosenbrock  type  method^.  We  have  a  growth  func¬ 
tion  with  a  denominator  factorizable  into  real  factors  n  ( 1-a  iihv  x) .  The  re¬ 

maining  degree  of  freedom  can  be  used  for  instance  to  iipise  L-stability  or  to 
minimize  the  error  at  infinity. 

Table  I  gives  the  R-K  parameters  of  various  schemes  associated  with  the  three 
interpolation  points 


4.3.  Collocation  methods 


s 

I 


a 


Starting  fran  the  integral  equation  with  ^=o,  we  can  obtain  various  integration 

schemes  if  we  substitute  in  the  right  hand  side  an  Hermite  interpolation  formula. 

Taking  t*h,  y(x)  is  developped  in  terms  of  basis  functions  which  are  polynomials 

(k)  It) 

and/or  exponentials .  The  only  information  needed  is  y  (o)  and  y  (h) ,  with 
k=»o,  1, . .  .p-1;  l=o,  1 . .  .q-1  obtained  from  the  successive  derivations  of  A(t)  at  t=o 
and  h.  For  instance  for  linearly  varying  operators  (ramp  reactivities) 


A(t)  =  (1  -  £)A(o)  +  £  A(h)  (46) 

the  following  advancement  matrices  (defined  by  y(h)  y(o) )  are  obtained 

by  Hennart(31) 

P11M  !  t 1  ”  §  A(§  h)] ^  [l+^hA(|)J 

(47) 

P12M  5  1 1  -  \  hA(^)  +  ^  A(|)A(h)]_1  [I  +|A(|)] 


More  general  collocation  methods  allowing  spectral  matching  can  be  found  in  ref. 
(30-32)  for  instance. 


4.4.  Extrapolation  and  smoothing 


The  pcwer  of  extrapolation  and  smoothing  techniques  for  stiff  problems  has  been 
exemplified  for  instance  by  Lindberg^^  .  It  has  been  used  in  reactor  kine¬ 
tics  by  Izumi  and  Noda^^  ,  Devooght  and  Mund^31^ ,  Lawrence  and  Doming  , 
Hofer  and  Werner  . 

Smoothing  vector  y  gives 

y(tn,h)  -  j  +  +  ^(tn+i'h^ 


(48) 


where  y(tn,h)  is  the  numerical  solution  at  time  tfi  confuted  with  tire  step  n. 

Then 

y(tn,h)  =  y(tn,h)  +  d^t^hjh2  +  d^t^hjh4  +  wR(h)  (49) 

where  wn(h)  is  a  smoothed  error  term  given  in  ref.  (48) .  Finally 

y(tn,h)  *  |  y(tn,  |)  -  3  y(tn,h)  =  y(tn)  +  0(h4)  +  wR(h)  (50) 

*\, 

Although  there  is  no  guaranty  that  wn(h)  will  be  snail  enough,  in  practice  it 
proved  so. 

Let  us  examine  for  instance  the  results  (46)  of  a  one  dimensional  two-group  bench¬ 
mark  kinetics  problem.  The  reactor  has  3  regions, 120  spatial  mash  points  and  a 
subcritical  transient  is  initiated  by  an  increase  the  thermal  absorption  cross- 

section  linearly  in  one  of  the  outer  regions. Results  of  code  WIGLE-40^61^  have  been 

2 

smoothed  and  extrapolated.  As  shown  in  figure  (2)  smoothing  yields  0(h  )  error, 

4 

and  extrapolation  alone  failed  to  give  0(h).  However  extrapolation  and  smoothing 

4 

contained  yield  0(h  )  accuracy.  The  drawback  of  smoothing  for  very  large  space- 
dependent  problems  is  the  need  for  increased  memory  storage.  Sane  additional  dis¬ 
cussion  of  extrapolation  is  given  in  §  5.  Other  methods  are  available.  For  instance 

(49) 

the  extrapolation  method  of  Bulirsch-Stoer  and  its  associated  time  step  con¬ 
trol  has  been  implemented  in  the  fast  reactor  kinetics  code  CASSANDRE^5^  . 


E 


n 


E 


io'*  io‘*  icr1  io° 

h(s) 


Fig.  2. a.  Percent  errors  for  ANL  Bench- 
nark  ID.6-A1  at  t  *  1.0  s. 


Fig.  2.b.  Percent  errors  for  ANL  Bench¬ 
mark  ID.6-A1  at  t  *  2.0  s. 
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5.  NUMERICAL  TESTS 


We  shall  give  in  this  paragraph  a  short  overview  of  the  effectiveness  of  various 
integration  schemes  devised  for  reactor  kinetics.  As  we  shall  find  in  §  6  it  is 
usually  sufficient  to  examine  the  point  kinetics  equation  for  comparison  purpo¬ 
ses.  The  variable  parameter  is  the  reactivity  p  (t)  and  three  standard  problems 
are  examined  :  step,  linear  ramp  and  periodic  reactivities.  Because  stiffness 
in  noticeably  increasing  as  the  neutron  life  time  A  is  decreasing,  both  thermal 
and  fast  reactors  must  be  treated  separately.  Accuracy  is  reported  in  the  fol¬ 
lowing  way  :  the  "exact"  flux  4>£(t)  is  obtained  with  the  best  integration  method 
applied  on  the  finest  mesh  compatible  with  reasonable  round-off  errors  in  single 
precision  on  a  CDC  6600  computer;  the  approximate  flux  $A( t,h) ,  the  number  of 
exact  digits  is  obtained  from  |#E(t)-0A(t,h)/0E(t)  |  *  10”11  and  reported  either  as 
a  function  of  time  step  h  or  as  a  function  of  arithmetic  operations. 

Spectral  matching,  when  needed,  results  from  interpolation  on  one,  two  or  three 
roots  of  the  point  kinetics  (KG=1)  characteristic  equation  (so  colled  "inhour 
equation")  obtained  from  (16) ,  K  and  P  being  scalar  : 

0i 

p  =  oiA  +  u >E  (51) 

i  V* 

Figure  (1)  reports  first  typical  eigenvalues  u>.h  with  time  step  10~^(s)s  h  $  101(s) 

-7  1 

for  a  fast  reactor  (A® 10  (s) )  for  a  step  reactivity  p  ®0.5  S.  The  local  trunca¬ 

tion  error  is  given  as  a  function  of  h  for  Pad§  approximants  of  the  exponential 
Pqi#  Pu,  for  Chdayschevapproximant  Ch^,  for  (1,7)  and  (1,7,5)  defined  by 
eq.  (38)  (40)  and  which  interpolate  respectively  at  and  The  effec¬ 

tiveness  of  spectral  matching  is  apparent;  it  is  essential  however  to  interpolate 
at  the  largest  eigenvalue  ] o>7] ,  which  amounts  to  impose  L-stability. 

Four  types  of  approximations  are  examined  ;  implicit  R-K  methods,  factorizable 
R-K  methods,  collocation  method  P^M  and  extrapolation  method  (without  smoothing) . 

Figure  (3)  compares  best  methods  for  a  1  i ramp  in  fast  reactor,  for  the  same 
number  of  arithmetic  operations.  The  superiority  of  Pl2M  stems  fran  the  fact  that 
for  stiff  problems  L-stable  are  preferable  and  that  P^^  approximants  are  devised 
for  linear  ramps. 


Figure  (4)  compares  iirplicit  R-K  schemes  for  a  reactivity  raip  in  a  thermal  reac¬ 
tor.  Stiffness  is  less  a  problem  :  L-stability  can  be  exchanged  at  profit  for 
the  higher  accuracy  of  Gauss  R-K  scheme. 

Figure  (5)  conpare  best  methods  in  each  category  for  periodic  reactivity  in  a  fast 

reactor  :  the  L-s table  factor! zable  Radau  I. A  scheme  fares  better.  Gauss  scheme 

4  -3 

accuracy  attains  its  asynptotic  0(h  )  only  for  h  <  5  10  . 

He  may  draw  tar  experience  cbtain«d(S0)  <E-  Mund'  f«sonal  oonrrunicatiaO  Qf 
figures  3,4,  5  are  a  snail  sanple  some  general  conclusions  : 

(1)  the  theoretical  order  O(h^)  accuracy  is  not  a  good  measure  of  accuracy  :  the 
value  of  p  may  be  obtained  in  some  cases  for  very  small  values  of  h. 

Sometimes  the  error  has  almost  a  step  behaviour  as  h  decreases  and  assessing 
accuracy  just  by  halving  the  time  step  can  be  very  much  misleading. 

(2)  L-stability  is  an  important  requirement  for  fast-reactor  kinetics. 

(3)  The  choice  of  the  interpolation  points,  even  the  dominant  one  is  not  very 
critical.  Moreover  it  is  good  enough  to  choose  the  dominant  root  of  the 
point  kinetics  equation  even  for  a  multipoint  or  fully  space  dependent  problem. 

(4)  For  multipoint-multigrovp  problems  at  least,  factor izabi li ty  is  an  important 
requirement. 

If  the  algorithm  should  be  robust  with  respect  to  changes  in  reactivity,  Makinson 
scheme  and  Radau  fare  best. 


Numerical  evaluation  of  the  truncation  error  at  0.5s  for  various  Implicit  RK  methods 


We  shall  not  deal  here  with  this  important  subject  but  uniquely  with  sene  parti¬ 
cular  aspects  where  stiffness  is  involved.  Basically,  in  the  diffusion  approxi¬ 
mation,  the  techniques  are  those  developped  for  the  integration  of  parabolic 
partial  differential  equations.  Let  us  recall  that  ADI  was  first  developped 
for  nuclear  reactor  statics  and  dynamics.  Most  problems  arise  with  the  direct 
or  iterative  solution  of  the  spatial  equations,  owing  to  the  fact  that  most 
methods  use  Crank-Nicholson  algorithms.  The  reader  may  find  a  recent  review  in 
ref.  (51) .  Obviously  nothing  prevents  to  use  the  methods  devised  for  multipoint 
kinetics.  However,  core  storage  limitations  would  automatically  restrict  useful 
algorithms  to  Rosenbrock  type  with  operators  evaluated  at  the  same  time.  Many 
existing  methods  can  be  cast  in  the  form  of  eq.  (27)  with  very  simple  interpo- 
lants  for  y(t) .  The  choice  of  T(t)  *  ewtI  has  often  been  made  but  a  bad  choice 
of  u  can  easily  worsen  the  results. 

A  very  powerful  method  can  however  simplify  the  space  dependent  kinetics  problem, 

(52)  (53) 

i.e.  the  quasistatic  method  and  its  generalizations  .  In  its  simpler  form, 

(54) 

the  flux  density  is  factored  according  to  eq.  (1) .  Let  the  evolutive  equa¬ 
tion  be  rewritten  as  a  scalar  equation  for  the  neutron  density 

-  BN  +  S  (52) 

where  B  is  an  integredi f ferentlal  operator  which  involves  the  contribution  from 
the  delayed  neutron  precursors.  Then 

-  T~l(t)B*T  -  +  T-1S  (53) 

CZC 

Given  initial  conditions,  we  can  choose  T  at  will  and  find  «|*  from  (53) .  But  the 
reverse  is  not  true.  Let 


<f,g>  ■  /fg  dr  dv  dfl 


(54) 


For  an  arbitrary  function  0  (r,vft,t) 


<0,*>  +  T(t)  <0,  |£>  «  <0,B0>  T(t)  +  <0,S>  (55) 

i.e.  the  "point  kinetics  equation"  but  in  fact  an  identity  which  does  not  bring 
in  any  information  that  is  not  already  in  (53) .  Despite  its  formal  appearance 
(53)  (55)  is  not  a  system  for  unknowns  T(t)  and  0(r,v  n,t) .  We  need  an  additional 
constraint  to  make  the  factorization  unique,  which  is  chosen  as 

<0,<t»  *  a (t)  (56) 

where  a(t)  is  a  given  function  usually  taken  equal  to  1.  The  system  to  be  solved 
is  (53)  (56) .  Then  T(t)  =  <0,N>  .  But  since  T(t)  can  be  chosen  at  will,  the 
solution  of  eq.  (53)  being  unique,  we  cannot  expect  constraint  (56)  to  be  satis¬ 
fied.  Therefore  we  inbecP5*  the  problem  in  a  larger  problem,  by  introducing 
for  instance  in  B  a  free  parameter  y  . 

We  solve  the  non  linear  system  (53)  (56)  for  0  and  parameter  y  .  One  method*55* 

is  to  use  the  Anselone-Sall  algorithm  which  is  a  Newton  iterative  procedure  which 

determines  the  iterates  *  and  y  .  If  we  inpose  y  *1,  i.e.  its  takes  its  physi- 

n  n  n 

cad  value  for  every  n,  then  T(t)  is  not  any  more  arbitrary  but  a  solution  of  the 
point  kinetics  equation  which  becomes  a  natural  offspring  of  the  constraint  (56) . 
We  have  finally  an  iterative  procedure  where  (r ,vSJ, t)  and  Tn(t)  are  determined 
iteratively  with  <0,0n>  *  1  for  every  n. 

Do  we  need  after  all  a  "point  kinetics  equation"  since  we  could  sol\e  (53)  direc¬ 
tly  for  a  given  T(t)  ?  Here  comes  the  stiffness  in. 

For  many  practiced  reactor  problems  the  shape  of  the  flux  does  not  change  much,  in 
general  in  all  cases  where  the  perturbation  of  the  critical  state  is  rather  homo¬ 
geneous.  It  is  usually  sufficient  to  start  with  0Q(r,vft,t)  proportional  to  the 

fundamental  eigenstate  at  time  t=o  and  stop  the  iteration  with  T  (t)  1  General  1  - 

o 

zed  quasi  static  methods  have  shewn  in  some  fast  reactor  transients  the  need  to 
iterate  a  few  times. 


Constraint  (56)  inposes  that  some  phase  space  average  of  4»  be  constant  :  as  long 
as  the  shape  of  the  flux  does  not  change  too  rapidly,  the  rapid  changes  of  the 
flux  are  transfered  to  T(t)  and  therefore  most  of  the  stiff  nature  of  the  problem 
is  concentrated  in  the  simpler  point  kinetics  equation.  It  means  that  two  diffe¬ 
rent  time  scales  can  be  used  :  small  time  stejs for  point  kinetics,  i.e.  T  and 
large  time  steps  for  the  shape  function  ii  .  However  for  reactor  transients  that 
are  strongly  space  dependent  like  asymmetric  channel  plugging  in  fast  reac¬ 
tors,  quasistatic  methods  might  lose  their  usefulness. 
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1.  INTRODUCTION 


This  paper  describes  the  evolution  of  an  automatic  ordinary  differential 
equation  (O.D.E.)  solver  in  an  industrial  environment.  There  are  two 
primary  purposes  to  this  paper.  The  first  is  to  describe  the  initial  selec¬ 
tion  and  subsequent  modifications  of  the  software.  The  second  is  to  show 
that  the  selection  and  modifications  were  in  direct  response  to  the  environ¬ 
ment,  i.e.,  to  the  types  of  problems  to  be  solved,  the  staff,  the  hardware, 
and  the  interface  requirements.  Although  this  paper  is  devoted  to  one 
particular  environment,  it  discusses  several  generic  considerations  which 
are  undoubtedly  applicable  to  other  industrial  installations.  Consequently, 
the  issues  we  discuss  should  be  representative  of  those  encountered  by  other 
industrial  users  of  automatic  mathematical  software. 

The  software  topics  discussed  are  of  a  general  nature.  They  should  be 
of  interest  to  the  developers  of  other  types  of  complex  mathematical  soft¬ 
ware  intended  for  use  in  a  production  environment.  In  particular,  software 
for  optimization,  the  solution  of  nonlinear  equations  and,  of  course,  method 
of  lines  share  many  of  the  same  characteristics.  For  example,  both  optimiza¬ 
tion  and  nonlinear  equation  codes  require  evaluation  of  a  user  supplied 
function,  possibly  formation  and  processing  of  a  Jacobian  that  may  be  either 
large  and  sparse  or  small  and  dense.  Many  of  the  points  discussed  are  also 
applicable  to  other  types  of  software  (e.g.,  quadrature  or  linear  algebra 
routines),  that  is,  all  mathematical  software  should  conform  to  accepted 
software  engineering  principles  which  include  adequate  documentation  and 
diagnostics  for  the  user. 


While  we  are  primarily  interested  in  the  software,  we  must  first 
describe  the  environment  in  order  to  understand  the  evolution  of  the  soft¬ 
ware.  This  description  has  several  facets.  It  includes  a  discussion  of  a 
typical  user  —  his  general  background  and  training,  his  specific  mathematical 
and  numerical  analysis  training,  and  the  difficulties  he  encounters  in  using 
automatic  software.  Also  included  is  a  discussion  of  the  manner  in  which  the 
user  interfaces  with  the  mathematicians  and  scientific  applications  programmers 
Related  to  this,  the  computer  hardware  configuration  is  briefly  described. 
Following  the  description  of  the  environment,  the  types  of  problems  which 
must  be  solved  numerically  on  a  routine  basis  are  discussed.  This  discussion 
highlights  problem  size,  execution  time,  reliability  requirements,  and  mathe¬ 
matical  peculiarities  which  cause  difficulties  for  automatic  software.  These 
problem  characteristics  demonstrate  why  software  designed  to  accommodate  stiff¬ 
ness  can  be  very  effective  in  this  context.  Furthermore,  they  provide  useful 
examples  as  to  why  techniques  incorporated  in  specialized  software  (e.g., 
root-finding,  automatic  method  switching,  coupled  differential-algebraic 
systems,  etc.)  are  required  in  software  intended  for  serious  use  in  an 
industrial  environment.  The  material  is  presented  In  chronological  order, 
thus  emphasizing  that  the  development  of  the  software  was  in  direct  response 
to  the  requirements  of  the  problems.  It  clearly  demonstrates  that  the  capa¬ 
bilities  which  we  discuss  are  required  to  effectively  use  the  software  in  an 


2.  DESCRIPTION  OF  ENVIRONMENT 


The  Nuclear  Power  Generation  Diviaion  of  Babcock  and  Wilcox  has  been 
primarily  concerned  with  the  design  and  analysis  of  nuclear  reactors.  This 
emphasis  has  profoundly  affected  the  structure  of  the  organization,  the 
types  of  individuals  employed,  the  computer  hardware  Installed  and,  of 
course,  the  types  of  problems  solved.  This  section  discusses  those  particular 
aspects  of  the  environment  that  have  had  the  greatest  Impact  on  the  evolution 
of  the  O.D.E.  software.  (A  detailed  discussion  may  also  be  found  in  [1].) 

The  bulk  of  the  workload  on  NPGD  computers  consists  of  many  repetitions 
of  programs  which  have  high  execution  times.  Coupled  with  the  quality 
assurance  requirements  Imposed  by  the  Nuclear  Regulatory  Commission,  this 
has  brought  about  a  predominantly  closed  shop  environment  [1].  This  is  a 
closed  shop  not  only  in  the  computer  programming,  but  also  in  the  specifica¬ 
tion  of  the  engineering  problem  and  of  the  numerical  solution.  To  develop  a 
computer  program  in  this  environment  requires  the  successful  interaction  of 
all  three  groups  of  people  (i.e.,  the  engineers,  programmers,  and  mathematicians). 
Let  us  briefly  consider  then  the  individuals  in  each  of  these  three  groups. 

The  Applied  Mathematics  Unit  has  a  full-time  staff  of  six  professionals, 
five  of  whom  bold  Ph.D.'s  in  applied  mathematics.  This  staff  is  augmented 
by  graduate  and  undergraduate  cooperative  students  and  by  consultants  as 
needed.  The  Engineering  Applications  Programming  Unit  is  staffed  by  twenty- 
five  degreed  programmers.  Over  half  of  these  people  hold  masters  degrees 
or  are  actively  pursuing  graduate  work.  These  degrees  are  in  the  areas  of 
pure  mathematics,  physics,  and  various  engineering  disciplines.  Only  one 
degree  is  in  applied  mathematics.  The  Technology  Units  in  the  Engineering 


Department  are  staffed  by  forty-five  engineers,  all  of  whom  hold  masters 
degrees.  Approximately  half  of  the  people  either  hold  Ph.D.'s  or  are 
actively  pursuing  one.  These  degrees  are  in  various  fields  of  engineering 
and  physics. 

While  the  programmers  and  engineers  involved  in  the  development  of 
computer  programs  are  extremely  well-educated  in  their  respective  areas, 
they  lack  a  formal  education  in  numerical  analysis.  Only  about  102  of  the 
individuals  have  had  a  formal  course  in  numerical  analysis.  Another  25Z 
have  had  a  numerical  methods  course  for  engineers  or  physicists.  Thus,  it 
is  very  important  to  realize  that  while  these  groups  are  extremely  well- 
educated  in  other  areas,  they  are,  for  the  most  part,  numerically  unsophisticated. 
We  suspect  that  this  pattern  is  carried  over  to  most  other  installations. 

This  background  should  be  kept  in  mind  by  developers  of  mathematical  soft¬ 
ware,  particularly  in  the  more  complex  areas  such  as  O.D.E.'s. 

Now  let  us  turn  our  attention  to  the  hardware  that  is  used  by  these 
people.  The  NPGD  Computer  Center  includes  interconnected  CDC  CYBER  76, 

CYBER  750,  and  CYBER  720  computers.  The  CYBER  720  and  CYBER  750  process 
all  of  the  interactive  work  and  act  as  a  system  interface  for  all  of  the 
batch  processing.  Most  of  the  scientific  processing  is  done  on  the  CYBER  76 
and  the  CYBER  750.  The  CYBER  76  has  a  27.5  nanosecond  clock  period  with 
64,000  sixty-bit  words  of  small  core  memory,  having  a  220  nanosecond  access 
time  per  word.  It  also  has  available  256,000  sixty-bit  words  of  large  core 
memory,  having  a  687.5  nanosecond  access  time  per  octet  of  words  (i.e., 
total  time  to  access  eight  words  stored  in  consecutive  storage  locations) . 

^These  percentages  are  baaed  on  the  response  to  a  question  pertaining  to 
mathematical  background  that  was  asked  at  the  beginning  of  an  internal 
training  course  on  numerical  O.D.E.'s  taught  by  R.  L.  Brown  and 
M.  A.  Feldsteln. 
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The  CYBER  750  has  a  25  nanosecond  clock  period  with  198,000  sixty-bit  words 
of  memory,  having  a  575  nanosecond  access  time  per  word.  Clearly,  these 
systems  are  relatively  large  and  fast.  These  facts  should  be  kept  in  mind 
during  the  discussion  of  the  difficulties  that  arise  in  terms  of  required 
storage  and  overall  execution  time. 


3.  DESCRIPTION  OF  PROBLEMS 


As  ve  have  seated,  the  evolution  and  development  of  the  software  in 
question  was  in  direct  response  to  requirements  of  the  problems  which  we 
must  solve.  Therefore  it  is  necessary  to  have  an  understanding  of  these 
problems  in  order  to  appreciate  the  software  development.  The 
problems  to  be  solved  are  primarily  related  to  the  design  and  analysis  of 
nuclear  reactors.  As  has  been  discussed,  the  physical  model  is  usually 
developed  by  the  engineers,  although  it  is  becoming  more  frequent  for  this 
to  be  done  in  conjunction  with  the  mathematicians.  It  is  not  uncommon  to 
encounter  problems  involving  time  simulations  ranging  from  a  millisecond 
(Example  3,  below)  to  hundreds  of  seconds  (Example  4),  or  with  solution 
components  ranging  in  magnitude  from  10  ^  to  10^°  in  the  same  problem. 
Furthermore,  it  is  common  to  encounter  problems  which,  in  spite  of  the 
introduction  of  simplifications  in  the  physical  model,  remain  extremelv 
difficult  from  a  numerical  point  of  view.  (Indeed  the  simplifications 
involved  in  a  crude  model  often  render  the  simplified  model  much 
more  difficult  numerically.)  The  software  has  been  used  to  solve  a 
wide  variety  of  such  problems.  Table  1  contains  a  summary  of  several 
representative  problems.  The  problems  described  in  Table  1  are  relatively 
straightforward  in  the  sense  that  they  represent  initial  value  problems 
in  the  standard  form 

y’  -  f(t,y) 

-v<to)  "  y0 
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and  which  do  not  possess  any  noteworthy  special  features  'a.g.,  severe 
discontinuities  in  f).  They  are,  therefore,  amenable  to  solution  by  many 
of  the  available  high-quality  solvers  [2].  In  fact,  several  such  solvers 
[3]  are  routinely  used  to  solve  the  problems  in  this  first  category,  both 
on  a  production  and  a  non-production  basis. 

A  second  category  of  problems  is  of  primary  interest  in  this  paper. 

These  problems  include  those  in  non-standard  form  which  oossess  unusual 
requirements  and  which  are,  therefore,  not  amenable  to  solution  by  pre¬ 
viously  available  standard  software.  The  most  frequently  encountered  sub¬ 
systems  are:  the  equations  of  fluid  flow,  heat  diffusion  in  a  cylinder, 
and  neutron  kinetics  equations.  (In  each  case,  one-dimensionality  and  other 
simplifying  assumptions  are  frequently  introduced.)  Although  the  Individual 
problem  types  are  well-known,  several  difficulties  arise  when  they  are  solved 
simultaneously.  The  computational  difficulties  associated  with  the  solution 
of  these  coupled  systems  are  primarily  related  to:  the  detail  and  number 
of  models  (that  is,  the  number  of  equations),  the  complexity  of  the 
auxiliary  computations  Involved  in  the  calculation  of  derivatives  (that 
is,  the  execution  time),  and  finally,  the  occurrence  of  special  events  which 
radically  alter  the  character  of  the  equation  set  (e.g.,  changing  from  stiff 
to  non-stiff,  changing  the  number  of  equations,  introducing  derivative 
discontinuities).  Although  we  are  well  aware  of  the  difficult  questions 
associated  with  the  solution  of  hyperbolic  and  mixed  systems  of  partial 
differential  equations,  our  primary  concern  here  will  be  with  the  behavior 
of  the  system  of  O.D.E.'s  after  some  discretization  has  been  imposed.  This 
approach  is  fairly  common  [4,60-63]  and  will  allow  us  to  more  adequately 
address  the  software-related  issues  in  question. 
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It  Is  Chis  second  category  of  problems  which  directly  influenced  the 
software  evolution.  Table  2  contains  descriptions  of  four  representative 
problems  from  this  category.  The  descriptions  include  the  special  problem 
requirements  which  dictated  the  software  development  and  other  pertinent 
information  for  each  problem.  Each  of  these  problems  actually  represents 
the  basis  for  a  corresponding  large-scale  software  project.  Therefore,  the 
times  during  which  the  corresponding  project  developments  took  place  is  also 
included.  This  places  in  perspective  the  specific  O.D.E.  software  develop¬ 
ment.  (In  fact,  each  of  these  projects  represents  an  ongoing  effort.  The 
entries  in  the  table  correspond  to  the  times  when  the  bulk  of  the  work  was 
performed.)  Also  included  are  indications  of  the  complexity  of  the  computer 
coding  associated  with  the  calculation  of  system  derivatives.  It  should  be 
noted  that  the  number  of  lines  of  coding  does  not  account  for  the 
additional  use  of  a  large  mathematical  software  library  of  auxiliary  sub¬ 
routines  [3],  For  example,  spline  software,  linear  equation  solvers,  non¬ 
linear  equation  solvers,  and  water  property  tables  from  this  library  are 
extensively  used.  Use  of  these  routines  can  effectively  add  anywhere  from 
a  few  hundred  to  several  thousand  lines  of  additional  coding. 

In  order  to  further  highlight  the  special  problem  requirements 
associated  with  the  examples  described  in  Table  2,  a  more  detailed  descrip¬ 
tion  will  now  be  given  for  each  problem.  The  precise  physical  and  mathe¬ 
matical  details  of  the  problems  are  omitted  for  the  most  part,  since  they 
are  quite  lengthy  and  in  each  case,  they  may  be  found  in  the 
corresponding  references. 
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Example  1  (Core  Reflood  Analysis  [22]) 


The  performance  of  Che  reacCor  coolant  system  during  the  refill  and 
reflood  phases  of  a  postulated  loss-of-coolant  accident  is  simulated  by  this 
example.  At  the  end  of  blowdown  (i.e.,  the  rupturing  of  a  coolant  pipe), 
the  water  level  in  Che  reactor  vessel  is  usually  calculated  to  have  dropped 
below  the  elevation  of  the  bottom  of  the  core,  leaving  the  core  lanersed  in 
saturated  or  superheated  vapor.  As  the  emergency  core  coolant  is  Injected 
into  the  system,  the  water  level  in  the  vessel  lower  plenum  rises  and 
eventually  the  bottom  of  the  core  is  recovered.  A  set  of  one-dimensional 
fluid  flow  equations  coupled  to  a  simplified  heat  equation  is  used  to  model 
this  process.  What  this  translates  to  mathematically  is  a  system  of  100-150 
problem-dependent  0.0. E. 's.  During  the  initial  phase  of  the  transient 
(before  the  water  level  rises  to  the  bottom  of  the  core) ,  the  problem  is 
both  non-stiff  and  relaLJ”ely  easy  to  solve.  When  the  water  initially  comes 
in  contact  with  the  bottom  of  the  core,  additional  equations  are  activated, 
derivative  discontinuities  occur  in  several  components,  and  an  extremely 

rapid  transient  ensues  with  the  accuracy  restrictions  dominating  the 
stability  restrictions.  As  the  core  recovery  proceeds,  a  steady-state 
condition  is  approached  and  the  problem  becomes  mildly  stiff  (stiff, 
oscillatory  in  the  sense  of  Gear  [23]).  Finally,  the  core  flood  tanks  that 
injected  the  emergency  core  coolant  empty  and  must  be  explicitly  turned  off. 

The  first  difficulty  that  arises  for  the  automatic  software  is  precisely 
locating  the  time  at  which  the  water  strikes  the  bottom  of  the  core.  The 
difficulty  is  compounded  by  the  fact  that  this  time  is  a  function  of  the 
solution.  This  point  must  be  properly  located  in  order  to  generate  an 
accurate  solution  and  also  to  avoid  (in  this  case,  excessive)  chattering 
[43,64]  in  the  integrator.  A  second  difficulty  is  associated  with  the 
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Example  2  (Reactor  Pressurizer  [24]) 

Pressurized  water  reactor  systems  normally  use  a  steam  pressurized 
surge  tank  to  maintain  and  control  primary  system  pressure.  This  surge 
tank  Is  commonly  referred  to  as  a  pressurizer.  During  normal  operation, 
the  upper  portion  of  the  vessel  contains  pressurized  steam  and  the  lower 
portion  contains  liquid.  A  decrease  in  the  primary  system's  coolant  pressure 
causes  an  outsurge  of  water  from  the  pressurizer  to  the  primary  system, 
which  compensates  for  any  cooling  or  the  removal  of  the  primary  system 
coolant.  An  increase  in  the  primary  system's  coolant  pressure  causes  an 
insurge  of  primary  coolant  into  the  pressurizer,  which  compensates  for  any 
heating  or  the  injection  of  primary  coolant.  The  pressurizer  is  maintained 
at  a  reasonably  constant  pressure  through  the  use  of  heaters  to  raise  the 
pressure,  sprays  to  lower  the  pressure  by  condensing  steam,  and  relief  valves 
to  rapidly  lower  the  pressure.  The  fluid  model  consists  of  three  reaions 
which  normally  include:  superheated  steam,  saturated  mixture,  and  sub¬ 
cooled  liquid.  Various  combinations  of  these  reaions  are  in  existence  at 
any  given  time  and  a  given  region  can  be  created  or  deleted  in  the  course 
of  a  transient.  The  model  consists  of  O.D.E.'s  derived  from  the  conservation  of 
mass  and  energy  equations  for  the  liquid,  equations  defining  the  rise  of 
the  bubbles,  and  equations  describing  the  behavior  of  the  heaters.  This 
forms  a  system  of  fourteen  ordinary  differential  equations.  There  are  also 
a  large  number  of  auxiliary  computations  related  to  the  interface  con¬ 
ditions  between  the  regions  and  to  condensation. 

The  problem  is  small,  non-stiff  and  for  the  most  part,  relatively  easy 
to  solve.  It  is  included  here  for  several  reasons.  The  first  is  that 
without  an  extremely  accurate  and  consistent  set  of  water  property  tables 
(e.g.,  [19]),  it  is  impractical  to  solve  this  problem  purely  as  a  system  of 
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ordinary  differential  equations.  Rather  it  is  necessary  to  recast  the 
system  into  one  consisting  of  a  mixed  system  of  ordinary  differential  and 
nonlinear  algebraic  equations.  It  is  interesting  to  note  that  several 
different  forms  of  this  system  can  be  written,  most  of  which  are  analytically 
unstable.  The  next  important  feature  is  that  the  sprays,  heaters,  and 
valves  form  a  set  of  nineteen  special  events  that  must  be  correctly  accounted 
for  in  order  to  generate  an  accurate  solution  (each  of  these  events  is  a 
function  of  the  solution).  Therefore,  there  are  more  special  events  than 
O.D.E.'s!  Finally,  it  is  necessary  to  restart  the  integration  following  most 
of  these  special  events  in  such  a  way  as  to  avoid  missing  one  of  the  other 
events.  This  model  is  actually  a  subsystem  that  is  used  in  many  other 
computer  programs,  notably  Example  4. 
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Example  3  (Bubble  Growth  In  a  Time-Dependent  Pressure  Field  [25]) 

The  problem  of  homogeneous  nucleatlon  (bubble  growth)  and  flashing  in 
a  superheated  liquid  undergoing  rapid  decompression  is  illustrated  by  this 
example.  Since  it  is  conceivable  that  a  reactor  coolant  undergoing  rapid 
decompression  during  a  loss-of-coolant  accident  could  become  highly  superheated 
before  flashing,  several  well-known  blowdown  experiments  have  been  performed 
relative  to  this  question.  This  example  represents  a  general  solution 
technique  for  analyzing  the  validity  of  such  experiments  and  toward  the 
inclusion  of  nonequilibrium  flashing  effects  in  the  calculation  of  the 
loadings  on  the  reactor's  Internal  components  during  a  loss-of-coolant 
accident. 

The  process  is  modelled  by  a  system  of  coupled  partial  and  ordinary 
differential  equations  consisting  of  the  heat  conduction  equation  (for  the 
thermal  layer  enveloping  a  vapor  bubble) ,  the  equation  of  motion  (to 
describe  the  growth  of  a  vapor  bubble)  and  a  third  subsystem  (to  describe  the 
state  of  the  surrounding  bulk  liquid).  Since  the  subsystems  are  coupled  at  the 
bubble  surface,  a  moving  boundary  problem  must  be  solved.  Furthermore, 
the  coupling  is  algebraic  in  some  cases.  This  model  has  been  used  to 
correctly  analyze  the  well-known  bubble  growth  measurements  in  a  decreasing 
pressure  field,  bubble  growth  in  constant-pressure  water,  and  novel  chamber- 
and-piston  experiments.  Depending  on  the  problem  to  be  analyzed,  the 
resulting  system  of  O.D.E.'s  contains  20-300  equations.  Initially  the  pro¬ 
blem  is  non-stiff.  It  is  necessary  to  then  determine  precisely  the  time 
at  which  the  bubble  forms  and  begins  to  grow  —  which  is  again  a  function  of 
the  solution.  At  this  point,  the  microscopic  flashing  model  is  activated  — 
resulting  in  a  very  rapid  transient,  followed  by  a  gradual  approach  to 
steady-state  as  pressure  recovery  proceeds. 
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The  first  difficulty  that  arises  for  the  automatic  software  is  the 
determination  of  the  time  corresponding  to  the  onset  of  bulk  nudeation. 

The  next  difficulty  is  related  to  the  algebraic  coupling  at  the  bubble 
surface  which  results  when  the  microscopic  flashing  model  is  activated. 

A  third  difficulty  is  related  to  the  necessity  to  switch  to  stiff  solution 
techniques  to  accommodate  the  mild  stiffness  associated  with  the  gradual 
pressure  recovery.  Sometimes  a  final  difficulty  arises.  The  problem  is 
solved  by  a  method-of -lines  [26]  approach  in  which  any  one  of  several 
transformations  is  used  both  to  immobilize  the  moving  boundary  [26] 
to  scale  the  variables  [27].  Depending  on  the  various  transformations  used 
root-finding  involving  the  old  independent  variable  and  the  new  dependent 
variable  is  required  for  locating  the  output  points  [9]. 


Example  4  (Simulation  of  Upset  Transients  [28]) 

This  problem  simulates  the  performance  of  the  reactor  coolant  system 
during  transients  caused  by  minor  upset  conditions  (e.g.,  loss  of  offsite 
power,  or  loss  of  main  heat-sink).  It  Is  assumed  that  all  of  the  transients 
start  from  steady-state.  Since  all  of  the  transients  are  relatively  slow, 
a  simplified  mathematical  model  is  used  which  consists  of  the  following 
coupled  subsystems: 

(1)  the  one-dimensional  cylindrical  heat  equation 
(11)  a  simplified  set  of  one-dimensional  flow  equations 
(ill)  the  time-dependent  point  kinetics  equations 
(iv)  several  auxiliary  subsystems  (e.  g. ,  Example  3) 

The  total  system  contains  approximately  150  O.D.E.'s  and  is  usually 
considered  stiff  due  to  the  presence  of  subsystem  (ill).  The  system  is  initialized 
by  holding  all  boundary  conditions  constant  and  integrating  until  steady-state  is 
achieved.  Depending  on  the  transient  to  be  simulated,  a  time-dependent 
transient  forcing  function  is  then  activated.  One  common  simulation  involves 
the  step  insertion  of  reactivity  into  the  point  kinetics  equations.  Owing  to 
the  system  coupling,  this  effectively  introduces  a  severe  jump  discontinuity  in 
the  time  derivative  which  is  quickly  transmitted  throughout  the  rest  of  the 
system  —  resulting  radical  change  in  the  nature  of  the  various  sub¬ 

systems  (e.g.,  change.  ...  „he  mode  of  heat  transfer  from  the  fuel  pin  to 
the  reactor  coolant) .  It  is  also  necessary  to  incorporate  approximately  a 
dozen  reactor  trip  functions  in  the  model  (e.g.,  the  system  pressure  too 
high  or  too  low,  the  outlet  temperature  too  high,  or  outlet  flow  rate  too 
low).  Prior  to  the  trip,  the  problem  is  stiff,  but  relatively  easy  to 
solve.  After  trips  occur,  portions  of  the  system  remain  stiff.  However, 
other  portions  undergo  rapid  transitions.  Again  it  is  imperative  to 


precisely  locstc  the  tine  of  the  trips  ia  order  to  generate  an  accurate 
solution.  Finally  as  the  transient  proceeds,  it  again  approaches  steady- 
state.  It  should  also  be  kept  in  mind  that  all  of  the  concerns  outlined 
in  Example  3  are  also  present  here.  In  this  problem,  it  is  crucial  that 
the  stiff  reactivity  component  be  accurately  tracked,  due  to  its  strong 
feedback  into  the  remaining  system  components. 

The  preceding  discussions  by  necessity  are  quite  terse.  Detailed 
discussions  may  be  found  in  the  indicated  references,  from  which  the  above 
discussions  were  excerpted.  However,  it  is  anticipated  that  the  present 
discussions  convey  a  general  understanding  of  the  problem  requirements  which 
guided  the  evolution  and  development  of  the  software  to  be  discussed  in 
the  next  section. 


4.  DESCRIPTION  OF  SOFTWARE 


Ideally,  Che  development  of  Che  software  should  be  presented 
strictly  chronologically  In  order  to  aphasias  Its  evolutionary  nature. 
Unfortunately,  this  becomes  cumbersome  when  discussing  particular  features, 
since  they  were  actually  incorporated  and  refined  over  a  relatively  long 
period  of  time.  In  Table  3  we  have  indicated  the  general  chronology  of  the 
development.  This  again  re-emphasizes  the  fact  that  the  development  of  the 
software  was  evolutionary  and  in  direct  response  to  problem  and  user  require¬ 
ments  (as  may  be  seen  by  comparing  the  dates  In  Tables  2  and  3).  In  addition 
to  the  items  which  are  Included  in  Table  3,  we  also  wish  to  discuss  several 
other  issues.  The  material  to  be  discussed  below  therefore  Includes: 

(1)  the  initial  assessment  and  selection  of  an  integrator 

(2)  the  addition  of  generalized  Adams-Type  methods 

(3)  the  addition  of  the  capability  to  solve  systems  of  coupled 
differential  and  algebraic  equations 

(4)  the  addition  of  root-finding  capabilities 

(5)  the  addition  of  diagnostic  tools 

(6)  user  support 

(7)  additional  software  engineering  considerations 


4.1  Initial  Selection 


The  first  step  in  the  development  consisted  of  an  assessment  of  avail- 
able  software  and  an  initial  selection  from  this  software.  The  results  of 
this  phase  are  reported  in  [2,29-30].  The  specific  software  which  was 
evaluated  consisted  of  Runge-Kutta  software  [11-12],  non-stiff  Adams 
software  [9,13],  stiff  collocation  software  [7-8],  and  several  variants  of 
GEAR  [14,31]  and  EPISODE  [15,17,32].  (Several  other  commercially  available 
solvers  were  also  evaluated,  but  were  excluded  since  they  were  not  com¬ 
petitive  with  the  above  software.)  Fortunately,  several  previous  evaluations 
had  been  performed  elsewhere,  notably  [11,33-38].  These  previous 
evaluations  significantly  influenced  our  assessment.  Subsequently,  several 
of  'the  above  solvers  were  selected  for  inclusion  in  our  Standard  Mathematical 
Subroutine  Library  [3].  As  indicated  by  Table  1,  these  solvers  are  still 
used  in  one  or  more  applications  at  B&W.  Furthermore,  the  capabilities  of 
[13]  have  influenced  the  portion  of  the  development  of  our  software 
related  to  the  addition  of  root-finding.  However,  it  was 
decided  that  in  our  environment  GEAR  and  EPISODE  were  the  most  appropriate 
candidates  for  a  general-purpose  solver.  (On  the  other  hand,  any  reader 
familiar  with  the  other  software  mentioned  in  this  section  will  perceive  our 
debt  to  the  developers  of  the  other  software.  Obviously,  much  of  our 
development  consisted  of  tailoring  and  expanding  their  ideas  so  that  the 
software  could  be  used  in  our  environment.  Without  their  work  to  rely  on, 
the  solution  of  our  problems  would  not  have  been  possible.) 

Due  to  the  sizes  of  the  problems  in  Table  2,  we  found  it  difficult  to 
use  GEAR  and  EPISODE.  (Recall  that  a  large  amount  of  available  computer 
memory  is  usually  required  to  evaluate  system  derivatives  for  our  problems.) 
It  was  also  usually  not  feasible  to  use  their  banded  counterparts  [14-15,17] 


—  sloe*  the  coupling  of  fluid  (In  a  loop),  heat,  and  kinetics  equations 
results  In  a  Jacobian  that  does  not  have  a  banded  structure  (nor  a  nearly 
banded  structure  In  which  elements  outside  a  given  band  may  be  neglected) . 
Therefore,  It  was  decided  to  use  to  sparse  solver  GEARS  [39]  as  the  base 
for  our  general-purpose  solver.  GEARS  consisted  of  GEAR,  but  with- the  linear 
algebra  portion  replaced  by  a  version  of  the  Tale  Sparse  Matrix  Package 
[40].  Therefore,  the  present  software  evolved  specifically  from  [39]. 

(More  recent  versions  of  this  software  are  available  [57],  however,  they 
do  not  address  the  difficulties  encountered  In  our  problems.) 
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4.2  Adams-Type  Methods 

It  was  acted  In  Example  1  that  the  problem  becomes  mildly  stiff  and 
oscillatory  as  the  transient  proceeds.  It  was  found  that  the  third  and 
fourth  order  Adams-type  methods  in  [41],  when  used  with  a  Newton  iteration, 
handle  the  problem  more  efficiently  than  do  the  standard  Adams  methods  or 
backward  differentiation  methods.  Because  of  the  manner  in  which  they 

i 

were  constructed  (as  linear  combinations  of  standard  Adams  methods),  they 
were  also  easily  implemented  into  GEARS. 


The  gain  in  efficiency  over  the  standard  methods  is  due  to  the 
Ao-stabillty  of  the  methods  (absolute  stability  on  the  negative  real  axis) . 

For  problem  1,  the  solver  typically  runs  at  order  3  for  these  methods 
whereas  it  typically  runs  at  order  2  (trapezoidal  rule)  for  the  standard 
Adams  methods.  Although  it  also  runs  at  order  3  for  the  backward  differentia¬ 
tion  methods,  they  have  larger  local  truncation  error  coefficients  than  those 
in  [41].  The  average  step  size  is,  therefore,  smaller.  Overall  a 
reduction  in  execution  time  by  a  factor  of  2-4  is  obtained  for  Example  1  using 
the  Adams-type  methods.  For  the  associated  production  computer  program,  which 
performs  many  long  repetitive  runs,  this  results  in  a  substantial  saving  in  the 
course  of,  say,  one  year. 


4.3  Coupled  Differential-Algebraic  Systems 


Systems  of  the  form 

(1)  y'  -  f(t,y,w),  y(to)  -  yQ 

(2)  0  -  r(t,y,w),  r(t  ,y  ,*  )  -  0  ,  w(t  )  -  v 

O  W  U  O  O 

arise  naturally  in  many  applications.  One  accepted  approach  is  to 
differentiate  (2)  with  respect  to  t  to  obtain  an  initial  value  problem  in 
standard  form.  As  noted  in  Examples  2  and  4,  this  may  not  be  possible 
since  an  explicit  expression  for  (2)  may  not  be  available  and  the  use  of 
finite  differencing  would  be  too  inaccurate.  Another  popular  approach  is 
to  solve  (2)  as  part  of  the  function  (1) .  This  frequently  proves  impractical 
for  several  reasons.  The  first  is  caused  by  the  interaction  of  the  con¬ 
vergence  error  in  the  iteration  with  error  control  in  the  integrator.  If 
the  convergence  tolerance  is  too  small,  the  program's  execution  time  is  high 
(due  to  the  iteration),  while  if  the  tolerance  is  too  large,  the  integrator 
error  test  fails  —  leading  to  prohibitively  small  step  sizes  and  resulting 
in  high  execution  times,  as  well  as  possibly  erroneous  results.  The 
second  drawback  is  that  the  evaluation  of  r  may  require  that  most  of  the 
computations  required  for  the  evaluation  of  f  be  completed.  This  again 
leads  to  an  extremely  high  execution  time  for  the  program.  A  third  draw¬ 
back  that  is  sometimes  overlooked  is  that  questions  related  to  problem 
stability  can  be  masked  by  indirect  solution  techniques  (e.g..  Example  2). 

In  order  to  circumvent  these  difficulties,  the  software  was  modified  to 
directly  solve  these  types  of  systems.  This  is  a  relatively  straightforward 
modification  —  since  a  Newton  iteration  is  already  available  for  the 
solution  of  the  corrector  equations.  All  that  is  required  is  to  modify 
the  usual  iteration  matrix  [42] 
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The  predictor  step  is  performed  in  the  usual  fashion  [42]  for  y  while  an 
algebraic  extrapolation  of  the  same  order  and  based  on  back-points  is  used 
for  w. 

• 

It  is  recognized  that  while  there  does  exist  some  theory  [44-46] 
pertaining  to  the  existence  and  uniqueness  of  solutions  to  (1)  and  (2), 
that  there  does  not,  at  this  time,  exist  adequate  justification  for  the 
numerics.  However,  the  actual  performance  has  been  very  satisfactory. 

The  procedure  has  been  used  to  accurately  solve  problems  that  otherwise 
could  not  have  been  solved.  Furthermore,  the  Integrator  has  failed,  with 
an  error  message,  anytime  we  have  attested  to  solve  a  problem  for  which 
there  does  not  exist  a  solution.  Consequently,  we  have  found  this 
capability  to  be  a  very  useful  one.  For  other  approaches  to  variants  of 
this  problem,  see  [44-46]. 

For  any  readers  who  might  be  Interested  in  also  implementing  such  a 
procedure,  it  is  also  of  interest  to  note  the  need  to  refine  the  algebraic 
values  at  output  points.  This  is  necessary  since  the  interpolated  output 
values  are  based  on  the  predictor.  Hence,  they  do  not  necessarily  satisfy 


4.4  Root-Finding 


Variants  of  this  capability  have  been  previously  uj»«sd  in  other  soft¬ 
ware,  notably  [13,43,47]  (see  also  [10]).  Our  approach  was  most  influenced 
by  that  used  in  [13],  where  the  procedure  is  referred  to  as  g-stops.  This 
terminology  will  also  be  used  throughout  this  section. 

The  overall  idea  is  to  construct  an  auxiliary  function  whose  zeroes 
correspond  to  the  occurrence  of  the  special  event.  For  instance,  consider 
Example  1.  When  the  water  level  in  the  lower  plenum  reaches  the  bottom  of 
the  core,  drastic  changes  in  the  nature  of  the  system  occur.  It  is  necessary 
at  this  point  to  take  special  action,  such  as  restarting  the  Integration. 
Thus,  the  auxiliary  function  would  be  the  height  of  the  bottom  of  the  core 
minus  the  height  of  the  water.  Hence,  the  occurrence  of  this  special  event 
corresponds  to  a  zero  of  the  auxiliary  function.  When  the  integrator 
locates  this  zero,  the  integration  is  then  restarted  with  an  appropriately 
small  step  size  to  accommodate  the  ensuing  rapid  transient.  This  same 
approach  can  be  used  in  Example  2  for  turning  heaters  on  and  off,  or  for 
opening  and  closing  relief  valves. 

In  general,  we  wish  to  locate  the  zeroes  of  the  equations 
0  ■  g1<t,y,y  )  ,  i  *  1, ♦ • • ,k 

simultaneously  with  the  integration  of  the  O.D.E.  In  each  case,  once  an 
interval  is  located  in  which  one  of  the  components  changes  sign,  a  root- 
finder  is  used  to  locate  the  corresponding  zero.  Brent's  one-dimensional 
algorithm  [48-49]  was  modified  to  perform  the  root-finding  in  the  present 


software. 


The  root-finding  is  of  two  types,  interpolatory  and  extrapolatory. 
Interpolatory  g-stops  are  used  when  it  is  possible  to  easily  integrate  past 
the  tiae(s)  corresponding  to  the  special  event  (e.g.,  when  the  event  does 
not  cause  severe  derivative  discontinuities).  For  example,  this  would  be  the 
case  if  the  event  corresponded  to  a  simple  switch  with  an  associated  time 
delay  to  obtain  output,  or  when  it  is  used  to  locate  times  at  which  a  given 
variable  attains  a  particular  value.  ([9]  discusses  several  novel  uses  of 
this  feature.) 

The  g-stop  function  requires  values  for  both  y  and  y'.  These  values 
are  available  from  the  Nordsieck  array  [23,50]  in  the  present  software. 
Basically,  after  the  completion  of  each  successful  step  (i.e.,  following 
the  corrector  iteration),  the  Nordsieck  array  is  extrapolated  backwards 
over  the  interval  corresponding  to  this  step  to  examine  for  sign  changes 
and  to  do  the  root-finding,  if  necessary.  In  this  case,  it  is  not  necessary 

to  restart  the  integrator  unless,  of  course,  the  root  corresponds  to  a 
severe  discontinuity  or  requires  that  other  special  action  be  taken.  There¬ 
fore,  interpolatory  g-stops  are  generally  efficient  since  they  do  not  inter¬ 
fere  with  the  integrator's  order  and  step  size  selection  procedure. 

Extrapolatory  g-stops  are  primarily  intended  for  a  totally  different 
purpose.  Generally,  they  are  used  to  cope  with  severe  derivative 
discontinuities  associated  with  special  events  which  frequently  occur  in 
the  problems  we  must  solve  —  for  example,  when  the  water  reaches  the  bottom 
of  the  reactor  core  in  Example  1.  Experience  with  these  problems  Indicates 
that  several  commonly  recommended  techniques  for  dealing  with  such  dis¬ 
continuities  do  not  work  well  for  us.  For  example,  simply  ignoring  the 
discontinuity  is  usually  disastrous.  If  it  is  severe  enough  (as  in 
Example  1),  the  presence  of  the  discontinuity  will,  at  best,  result  in 


excessive  integrator  chattering,  leading  to  extremely  high  execution  time. 

(More  often  than  not,  a  program  abort  is  encountered.)  Furthermore,  the 
special  event  often  necessitates  the  addition  or  deletion  of  portions  of 
the  model  (e.g.,  the  creation  or  disappearance  of  a  steam  region  in 
Example  2),  making  it  impossible  to  ignore  the  event.  Another  technique 
which  is  sometimes  reconsended  is  to  complete  the  current  step  and  then  allow 
the  changes  corresponding  to  the  special  event  to  be  made.  Depending  on 
the  nature  of  the  problem  prior  to  the  occurrence  of  the  special  event,  the 
step  size  being  used  is  invariably  much  too  large,  resulting  in  either  a  program 
abort  or  inaccurate  results.  This  technique,  therefore,  has  not  worked  well  for 

Extrapolatory  g-stops  correspond  to  using  the  predictor  to  detect  sign 
changes  over  the  next  step.  In  the  present  software,  this  is  accomplished 
by  extrapolating  forward  before  each  integration  step  is  performed  (l.e., 
before  the  corrector  iteration  is  initiated).  Since  only  the  predictor  is 
used  in  this,  the  discontinuity  can  be  located  without  requiring  any 
derivative  evaluations  for  points  past  the  discontinuity.  The  integration 

is  then  restarted  with  the  appropriate  changes  in  order  and  step  size 
necessary  to  "transcend"  the  discontinuity. 

Extrapolatory  g-stops  are  generally  less  accurate  than  interpolatory 
g-stops  since  they  are  located  using  only  the  predictor.  In  fact,  it  is 
possible  to  miss  roots  of  the  auxiliary  function  if  only  extrapolatory  g-stops 
are  used.  This  can  occur,  for  example,  if  the  auxiliary  function  does  not 
change  signs  following  the  prediclton,  but  does  change  signs  following  a 
correction.  In  problems  such  as  Example  4,  it  is  imperative  that  the  times 
corresponding  to  all  trips  and  switches  be  correctly  located,  since  the 
system  response  can  be  drastically  altered  otherwise.  (For  a  simpler 
example  where  this  is  the  case  see  (43].)  Therefore,  in  our  implementation. 


extrapolatory  root-finding  is  automatically  backed-up  by  interpolatory 
root-finding.  While  the  possibility  still  exists  that  a  g-stop  might  be 
missed  (the  corrector  polynomial  simply  may  not  reflect  the  sign  change 
in  the  auxiliary  function),  we  have  not  encountered  a  situation  where  a 
missed  extrapolatory  g-stop  is  not  picked  up  by  the  backup  interpolatory 
calculations.  Furthermore,  the  Increased  reliability  of  the  software  more 
than  adequately  compensates  for  the  slight  increase  in  overhead. 

Related  to  the  use  of  g-stops,  another  issue  that  seems  not  to  have 
received  much  attention  in  some  previous  software  is  that  of  scaling  the 
auxiliary  function.  It  is  usually  necessary  that  the  function  be  scaled 
appropriately  in  order  that  the  root-finder's  error  control  be  compatible 
with  that  of  the  integrator.  While  it  is  possible  to  leave  the  respon¬ 
sibility  for  such  matters  to  the  user,  we  consider  it  preferable  to  auto¬ 
matically  accommodate  this  matter  in  the  software. 


4.5  Diagnostic a 


Since  the  evaluation  of  the  derivatives  may  involve  several  thousand 
lines  of  executable  code  (e.g.,  the  entire  program  for  the  relatively  small 
problem  in  Example  3  comprises  approximately  17,500  lines  of  coding),  it 
is  imperative  that  effective  diagnostic  tools  be  available.  Errors  do 
occur  in  both  the  program  and  the  model  and  these  errors  must  be  located 
in  a  reasonably  efficient  manner.  The  diagnostic  tools  described  here 
evolved  as  the  problems  in  Tables  1  and  2  were  Implemented  and  debugged. 

They  represent  devices  which  were  found  to  be  convenient  and,  in  some  cases, 
indispensible.  While  they  probably  do  not  encompass  all  devices  that  might 
be  useful,  they  do  give  an  Indication  of  what  is  needed.  Our  philosophy  is 

that  the  user  will  Introduce  errors  and  that  the  software  should  actively 
cooperate  in  detecting  these  errors.  This  is  a  departure  from  the  currently 
accepted  approach  which  is  that  the  software  itself  need  only  be  correct. 
Following  is  a  brief  description  of  each  of  several  capabilities  which  are 
included  as  debug  options  in  the  software. 

(i)  Identification  of  troublesome  components  - 

Frequently,  only  a  few  components  are  causing  the  error  test  or 
the  corrector  iteration  to  fail  repeatedly.  An  option  is 
included  to  automatically  identify  these  components  by  using 
a  max  norm  and  printing  the  number  of  the  equation  that  caused 
failure. 

(ii)  Identification  of  oscillatory  components  - 

Model  deficiencies  or  programming  errors  are  often  manifested 
as  oscillatory  components.  Therefore,  an  option  is  included 
to  detect  a  sign  change  in  the  derivative  between  successive 
steps  and  to  print  the  component (a)  for  which  this  occurs. 
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(Hi)  Checking  Che  function  evaluation  - 

Given  the  complexity  of  the  function,  evaluation  of  some 
derivatives  may  not  be  defined,  particularly  during  initial 
debugging.  To  detect  this,  the  y*  vector  is  preset  to  negative 
infinity  (that  negative  number  which  is  largest  in  magnitude  and 
machine  representable)  before  the  evaluation  is  called.  After  the 
function  evaluation,  y'  is  checked;  and  if  any  components  have  not 
been  defined,  the  component  is  identified  and  the  integration  is 
halted.  A  related  problem  is  that,  from  the  use  of  fixed-step, 
first  order  techniques  used  in  many  extant  engineering  codes, 
users  have  grown  accustomed  to  naively  resetting  the  y-vector 
if  it  falls  outside  of  some  desired  physical  range  (e.g.,  re¬ 
setting  of  negative  flow  rates) .  Needless  to  say,  this  usually 
has  a  detrimental  effect  on  the  integrator.  In  order  to  detect 
this,  integer  sums  of  the  y  vector  are  formed  before  and  after 
the  function  evaluation  and  on  exits  and  entry  from  the  integrator. 
If  the  sums  do  not  agree,  an  error  message  is  issued  and  the 
integration  is  halted.  This  procedure  requires  only  four 
additional  storage  locations  and  easily  detects  an  error  that 
otherwise  is  extremely  difficult  to  locate.  (Extreme  paranoia 
resulting  from  consulting  for  some  particularly  difficult  program 
developments  has  led  to  also  checking  all  working  storage  currently 
in  use  by  the  integrator.) 

(iv)  Baiting  divergent  corrector  iterations  - 

It  has  been  observed  that  when  the  Newton  iteration  diverges,  it 
does  so  quite  rapidly.  This  can  lead  to  abnormal  terminations 
in  the  function  evaluation  (e.g.,  a  pressure  out  of  range  in  water 
property  tables).  To  avoid  this,  a  divergent  correction  iteration 
is  immediately  halted. 


(v)  Printing  the  Jacobian  - 


Some  programming  errors,  such  as  the  use  of  variables  from  a 
previous  time  step,  are  easily  detected  if  a  "picture"  of  the 
Jacobian  is  printed.  An  auxiliary  subroutine  is  provided  which 
prints  l»s  for  nonzero  values  of  the  Jacobian  and  0's  for  the 
zero  values. 

(vi)  Statistical  information  related  to  integrator  performance  - 

A  good  deal  of  other  information  regarding  integrator  performance 
is  also  collected,  although  it  is  only  indirectly  available  to 
the  user  (to  whom  the  information  is  usually  of  little  or  no 
interest).  This  information  is  often  of  substantial  interest  to 
the  numerical  analyst,  however .  For  example,  counts  of  the  number 
of  order  Increases  and  decreases  can  give  an  indication  of  order 
cycling,  which  is  often  associated  with  model  errors  or  instabilities. 
Other  useful  information  includes  the  number  of  steps  taken  at 
each  order,  the. number  of  integrator  crashes,  the  number  of  error 
test  failures,  and  the  number  of  corrector  convergence  failures. 

(vii)  Heuristics  - 

Although  this  capability  is  not  available  to  the  user,  it  is 
possible  to  alter  heuristics  that  are  built  into  the  software, 
in  a  very  convenient  manner.  This  includes  forcing  that  a  given 
number  of  corrections  be  performed  at  each  step,  requiring  that 
more  steps  be  taken  between  changes  in  order  and  step  size, 
attempting  to  steer  the  Integrator  toward  a  particular  order, 
biasing  the  order  selection  toward  high  or  low  orders,  changing 
the  strategies  regarding  the  frequency  of  updating  the  Jacobian, 
changing  the  definition  of  weighting  vectors,  switching  to  various 


norms,  and  ocher  miscellaneous  heuristics.  None  of  these  capa¬ 
bilities  may  be  used  to  override  the  adaptive  nature  of  the 
software.  Indeed,  each  Item  usually  degrades  integrator 
efficiency  due  to  the  conservative  nature  of  the  item.  Con¬ 
sequently,  this  capability  is  intended  strictly  for  use  by  the 
numerical  analyst  for  evaluating  relative  merits  of  heuristic^''*''- 
procedures  during  model  analysis  and  related  testing. 

While  most  of  the  above  devices  are  simple  and  relatively  straight¬ 
forward,  they  often  mean  the  difference  between  making  or  missing  a  delivery 
date  (and  hence,  whether  or  not  the  software  will  continue  to  be  used  by 
users  who  are  already  suspicious  of  a  black  box).  They  also 
relieve  some  of  the  burden  of  debugging  a  complicated  program  from  the 
numerical  analyst  by  providing  meaningful  diagnostics  to  the  programner  and 
the  engineer.  As  a  result,  they  increase  the  user's  confidence  in  the 
software  by  overcoming  to  some  extent  his  fear  of  its  black  box  nature,  by 
providing  useful  information  about  problems  that  are  present. 


The  typical  prograaner  or  engineer  described  in  Section  2  is  not 
equipped  to  effectively  utilize  a  piece  of  numerical  software  as  sophisticated 
and  complex  as  an  automatic  Integration  package.  To  ease  this  difficulty, 
simplified  driver  subroutines  that  solve  Initial  value  problems  in  standard 
form  have  been  written.  These  in  turn  preset  many  of  the  parameters  required 
and  then  call  the  standard  integrator.  This  allows  the  user  to  learn  and 
grow  as  the  complexity  of  his  problem  Increases.  At  any  given  time,  he  needs 
only  to  access  the  portions  of  the  Integrator  that  are  required  by  his 
problem.  To  Increase  the  level  of  mathematical  sophistication  of  the  typical 
user,  internal  training  courses  in  numerical  analysis  are  also  offered. 
Finally,  the  diagnostic  capabilities  of  the  integrator  have  been  expanded 
to  provide  meaningful  Information  to  the  user.  In  spite  of  the  fact  that 
this  does  not  eliminate  the  need  for  the  in-depth  involvement  of  an 
experienced  numerical  analyst  in  any  large  development  effort,  it  does 
contribute  to  the  usefulness  of  the  software  by  relieving  the  numerical 
analyst  from  many  of  the  routine  tasks  that  can  be  performed  by  engineers 
and  trained  prograaner s. 


4.7  Additional  Desired  Features 


There  are  several  additional  features  which  would  be  quite  useful  In 
our  environment,  but  which  have  not  yet  been  incorporated  into  the  soft¬ 
ware.  These  features  should  receive  future  attention  by  software  developers. 
([56]  contains  a  similar  sumary.)  They  include  the  following. 

4.7.1  Global  Error  Estimation 

Global  error  estimation  should  be  regarded  as  a  necessity  rather  than 
as  a  luxury.  A  global  error  estimate  should  be  available  in  simulations  of 
problems  such  as  those  discussed  in  this  paper.  Global  error  estimates 
would  not  necessarily  be  included  with  the  intent  that  this  error  be 
controlled  by  the  integrator,  but  rather  to  avoid  accepting  at  face  value 
incorrect  results.  As  an  example,  consider  the  following  apparently  simple 
problem  [51]. 


2  2 
-1  -9  cos  6t  +  12  sln6t  cos6t  12  cos  6t  +  9  sln6t  cos6t 


-12  sln26t  +  9  sln6t  cos6t  -(1  +  9  sin26t  +  12  sln6t  cosft) 


The  function  is  linear,  continuous,  L ipse hits  and  further  the  eigenvalues 
of  the  Jacobian  are  fixed  in  the  left  half  plane  at  -1  and  -10.  Thus, 
on  the  surface  this  seems  to  be  a  very  innocent  problem.  However,  the 


family  of  solutions  is  unstable;  a  fundamental  matrix  for  this  problem  is 


e2t(cos6t  + 
e2t(2cos6t- 


2  sln6t)  e”1  c(sin6t  -  2  cos6t) 


sin6t) 


-13t 

e  (2  sln6t  +  cos6t) 


Then  a  problem  such  as 

y"  -  A(t) (y-s(t))  +  s'(t) 
y(0)  -  s(0) 

where  A(t)  is  the  2x2  matrix  above  and  s(t)  is  any  well-behaved  vector 
function  has  a  solution  Imbedded  in  an  unstable  family.  Note  also  that 
action  based  on  a  knowledge  of  the  eigenvalues  of  the  Jacobian  matrix  for 
this  problem  may  be  disastrous. 

If  a  seemingly  innocent  problem  can  exhibit  such  pathological  behavior, 
any  meaningful  problem  must  be  treated  with  case.  To  our  knowledge  [12] 
is  the  only  widely  available  software  which  generates  an  accurate  global 
error  estimate  for  the  above  problem,  although  other  Integrators  generate 
a  "solution"  to  this  problem.  Further  investigation  of  the  ideas  in  [52-53] 
could  provide  a  useful  means  of  extending  global  error  estimation  techniques 
Co  many  of  the  popular  linear  multi-step  O.D.E.  solvers. 


4.7.2  Jacobian  Matrix 


The  manner  in  which  the  Jacobian  matrix  and  related  ones  are  processed 
should  be  carefully  examined.  For  example,  the  ideas  related  to  the  use 
of  auxiliary  storage  in  [54]  can  be  invaluable  for  problems  such  as 
Examples  1  and  3,  where  derivative  evaluations  are  very  complex.  The  simple 
idea  in  [54]  of  saving  the  Jacobian,  J,  so  that  it  does  not  need  to  be  re¬ 
calculated  when  a  change  in  order  or  step  size  requires  that  the  iteration 
matrix 

P  -  I  -  hgj 

be  reformed  and  decomposed,  would  greatly  enhance  the  attractiveness  of  the 
software  in  an  environment  such  as  ours. 

Alternate  forms  of  the  sparse  matrix  package  [40,58-59]  that  require  less 
storage  should  be  Included  in  the  software.  Also,  on  a  CDC  machine,  it  is 
wasteful  to  use  a  sixty  bit  word  to  store  an  integer  pointer.  The  use  of 
integer  packing  techniques  could  significantly  reduce  the  required  storage. 

Even  though  many  of  our  problems  involve  only  a  few  hundred  equations,  and 
are,  therefore,  small  by  many  standards,  storage  is  usually  a  limiting  factor. 

In  our  opinion,  the  benefits  of  being  able  to  solve  a  larger  problem  more 
than  compensate  for  the  machine  dependencies  Introduced.  Further,  the 
machine  dependencies  and  subsequent  nonconformance  with  desired  software 
engineering  practices  could  be  isolated  to  a  few  functions  or  subroutines. 


4.7.3  General 


Conformance  with  accepted  software  engineering  practices  would 
greatly  ease  the  maintenance  and  modification  tasks.  In  particular,  the 
procedures  for  error  testing,  the  solution  of  the  nonlinear  corrector 
equations,  and  the  selection  of  step  size  and  order  should  be  separate  modules. 
This  would  greatly  facilitate  the  plugging  and  unplugging  of  modules  to  perform 
these  tasks.  Also,  future  upgrades  to  the  software  could  more  easily  be 
incorporated  into  existing  software. 

Documentation  is  a  critical  area  that  needs  to  be  addressed  in  order 
for  the  software  to  be  more  widely  used.  Predominantly,  the  existing 
documentation  is  written  by  numerical  analysts  —  for  other  numerical  analysts. 
However,  as  was  noted  in  Section  2,  the  majority  of  the  users  have  not  had  even 
a  numerical  methods  course.  Thus,  it  is  imperative  that  documentation  be 
available  that  is  written  at  any  elementary  level  and  which  includes  a  large 
number  of  examples  with  detailed  descriptions  of  their  solutions.  As  an 
Indication  of  the  level  of  detail  which  we  have  in  mind  for  adequate  docu¬ 
mentation,  the  following  is  an  outline  of  the  documentation  currently  in 
preparation  [55]  for  the  present  software. 

1.  Introduction 

2.  Discussion  of  software  methodology  (an  elementary  discussion 
of  linear  multi-step  methods  and  related  concepts) 

3.  Discussion  of  the  software  (solution  procedure,  internal 
heuristics,  and  related  concepts) 

4.  Detailed  discussion  of  the  solution  of  several  (10)  related 


problems 


5.  SUMMARY 


This  paper  discussed  the  evolution  of  automatic  O.D.E.  software  in 
a  particular  environment,  which  is  representative  of  many  industrial 
Installations.  Characteristics  of  this  environment  which  affected  the 
software  development  were  described.  These  Included  descriptions  of  the 
typical  user,  the  hardware,  and  the  types  of  problems  to  be  solved.  In 
addition,  several  enhancements  were  discussed.  The  inclusion  of  these 
features  in  future  software  would  be  of  great  benefit  in  an  industrial 
environment.  Other  additional  items  and  capabilities  which  would  be 
useful  in  an  industrial  environment  were  also  identified  and  discussed. 

It  was  explained  why  future  growth  must  be  toward  software  which  is 
user  friendly.  This  requires  complete  and  thorough  documentation  at  a 
relatively  elementary  level  with  many  examples.  Extensive,  comprehensible 
error  diagnostics  are  required.  Although  the  in-depth  involvement  of  an 
experienced  numerical  analyst  will  still  be  required  in  large  development 
efforts,  the  considerations  identified  would  help  to  remove  user  suspicion 
and  reduce  reluctance  to  use  a  black  box.  For  the  software  to  gain  wide¬ 
spread  acceptance,  it  is  critical  that  it  not  present  additional  hurdles  to 
overcome  and  that  it  actively  cooperate  with  the  user. 

One  of  the  compelling  reasons  for  writing  this  paper  was  to  help  to 
bring  about  the  broader  usage  of  standard  mathematical  software,  especially 
in  the  area  of  O.D.E.'s.  Most  Industrial  Installations  lack  both  the 
resources  and  the  interest  to  make  many  of  the  software  enhancements 
discussed  in  this  paper.  Therefore,  the  responsibility  must  be  assumed 
by  the  developers  of  mathematical  software.  The  future  releases  of 


existing  software  can  gain  significantly  wider  acceptance  and  usage  by 
including  these  enhancements.  Consequently,  this  paper  should  provide 
valuable  feedback  to  the  developers  of  mathematical  software  as  to  what 
features  are  needed  by  industry  in  numerical  software.  Hopefully,  the 
approach  taken  here  will  help  to  increase  (and  in  some  cases,  establish) 
lines  of  communication  between  the  software  developers  on  the  one  hand 
and  the  software  users  on  the  other,  resulting  in  broader  and  more 
effective  usage  of  standard  mathematical  software. 
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Applicable 

Software 


TABLE  1.  Summary  of  Representative  Standard  Problem* 


Description  of  Physical 
_ _ Problem 


Coupled  heat  exchanger 
performance  [5-6] 


Interacting  species  — 
point  kinetics  equations 

Structural  mechanics 
modelling  of  reactor  per* 
forma nee  during  earthquake 
conditions. 

Finite-element  models  aris¬ 
ing  in  structural  mechanics 
problems. 

Quench-front  type  problems. 


Analytic  continuation. 
Dependence  of  the  solution 

of  systems  of  non-linear 
equations  depending  on  a 
parameter  [18-19]. 

Analysis  of  reactor  vessel 
internal  loads  [21]. 

Analysis  of  steam 
generator  performance 
[65-66]. 


Mathematical  Problem 
Requirements _ 


Ability  to  switch  from  a 
system  of  non-stiff 
equations  to  a  different 
system  of  stiff  equations. 

Integration  of  small  systems 
of  extremely  stiff  equations. 

Integration  of  rapid  oscilla¬ 
tory  transient  followed  by  a 
long  steady-state  integration. 


Integration  of  large  banded 
systems  of  equation 
(100-2000  equations). 

Integration  of  large  systems 
of  non-stiff  equations  with 
mild  derivative  discontinuities 
(100-500  equations) . 

Ability  U  track  solutions  of 
a  systfc#  oi  equations  con¬ 
taining  »u  imbedded  parameter. 
Simple  root-finding  require¬ 
ments. 

Integration  of  an  extremely 
large  (500-9000  equations) 
system  of  non-stiff  equations. 

Integration  of  large  banded 
extremely  stiff  equations 
(100-1000  equations) .  Root- 
finding  to  locate  phase 
change  boundaries. 


[7-12} 

[7-8] 

[13] 

[14-15] 

[16-17] 

[10,13,20] 

[20] 

[20] 
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TABLE  3.  Chronology  of  the  Software  Development 


Activity 

Initial  assessment  and  selection. 

Assessment  and  testing  of  GEARS. 

Dynamic  dimensioning,  subroutine  standardization, 
and  other  related  modifications. 

Incorporation  of  extrapolatory  root-finding. 

Incorporation  of  Adams-Type  methods. 

Incorporation  of  ability  to  solve  systems  of 
coupled  differential -algebraic  systems. 

Incorporation  of  extensive  diagnostics  and  user- 
oriented  debugging  devices  (e.g.,  auxiliary 
drivers) . 

Incorporation  of  lnterpolatory  root-finding. 

Incorporation  of  refinements  in  the  root-finding 
capabilities. 


Time 

1975- 1976 
1976 

1976- 1977 

1977- 1978 
1978 

1978- 1979 

1979- 1980 

1980- 1981 


1981 
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The  simulation  of  large  engineering  systems  such  as  nuclear  power  plants 
involves  the  numerical  integration  of  a  large  set  of  differential  equations. 
The  conventional  numerical  methods  and  solution  techniques  often  used  in 
these  simulations  can  lead  to  difficulties  due  to  stiffness  that  can  arise 
in  the  system  as  the  transient  unfolds.  A  technique  for  partitioning  the 
system  of  equations  Into  two  separate  classifications  and  methods  for 
integrating  the  variables  in  each  class  are  presented.  The  methods  are 
demonstrated  for  an  auto-pilot  model  and  a  pressurized  water  reactor  model 
of  the  St.  Lucie  Unit  2  PWR.  The  results  indicate  that  these  methods  are 
able  to  yield  considerable  savings  in  computer  CPU  time.  Savings  up  to  as 
high  as  eight-to-one  have  been  observed  over  conventionally  used  methods. 


INTRODUCTION 


The  simulation  of  large  engineering  systems  such  as  power  plants  of  both 
nuclear  and  fossil  type,  aircraft  systems,  etc*,  has  been  an  important 
activity  for  more  than  a  decade.  System  modeling  using  the  conservation 
laws  of  mass,  momentum  and  energy  in  the  case  of  mechanical  and  hydraulic 
systems  and  Kirchoff 's  voltage  and  current  laws  in  the  esse  of  electrical 
systems  is  a  first  step  in  the  simulation  scheme.  These  laws  when  used 
to  model  systems  such  as  pressurized  water  reactors,  for  example,  result 
in  a  set  of  partial  differential  equations.  Using  some  form  of  lumping, 
these  are  reduced  to  a  set  of  coupled  nonlinear  ordinary  differential 
equations.  Conventional  integration  schemes  such  as  Runge-Kutta-Merson, 1 
Episode,2  Gear,2  etc.,  while  generally  satisfactory,  are  highly  unsuitable 
for  some  classes  of  problems.  We  will  direct  our  attention  to  one  such 
class  of  problem  and  refer  to  it  as  two-tlme-scale  and  oscillatory  stiff. 

The  above  class  of  problems  having  highly  oscillatory  solutions  for  a 
small  subset  of  system  variables  requires  small  step  size  for  the  solution 
of  all  system  equations  when  using  the  above  integration  techniques.  Since 
the  number  of  system  variables  that  contribute  to  the  oscillatory  part  of 
the  solution  is  generally  small  compared  to  the  total  number  of  system 
variables,  significant  advantage  could  be  gained  by  partitioning  the 
system  into  (1)  those  variables  which  produce  highly  oscillatory  solutions 
necessitating  a  small  step  size  for  integration  and  (2)  the  remaining 
variables  which  can  be  Integrated  by  using  a  relatively  large  step  size. 

We  define  the  first  subsystem  as  the  fast  subsystem,  the  second  as  the  slow 
subsystem,  the  method  of  identifying  and  splitting  the  system  into  its  slow 
and  fast  component  parts  as  partitioning,  and  the  scheme  of  Integration  of 
such  partitioned  system  as  multlrate  integration.  It  must  be  pointed  out 
that  a  system  which  is  not  stiff  at  the  start  of  integration  could  become 
stiff  during  Integration,  and  that  a  system  which  has  been  partitioned  in 
one  way  may  need  repartitlonlng  during  the  course  of  integration. 

We  present  two  examples  illustrating  the  above  features  of  partitioning 
and  multirate  integration  for  the  cases  of  (a)  a  thirteenth  order  autopilot 
model  and  (b)  a  thirtieth  order  pressurized  water  reactor  system.  Also, 
we  will  present  a  proposed  scheme  for  monitoring  the  stiffness  of  the 
system  to  be  used  as  an  indicator  in  the  partitioning  of  the  system. 


PARTITIONING  OF  SYSTEM  EQUATIONS 

In  this  section  we  will  briefly  review  a  scheme  for  identifying  and  partition 
lng  a  nonlinear  system  of  equations  into  its  fast  and  slow  component  parts. 
The  scheme  for  identifying  the  slow  and  fast  components  of  a  linear  system 
is  due  to  O’Malley  and  Anderson11  and  has  been  extended  to  nonlinear  systems 
as  will  be  presented  below. 

Consider  a  dynamic  system  described  by 


(1) 


Z  -  F(Z,U,t) 

where  Z  end  U  are  n  and  m-dimensional  vectors  respectively,  and  F  denotes  the 
vector  dependence  between  Z  and  U.  In  the  control  literature,  these  are  re¬ 
ferred  to  as  the  state  and  control  vector  respectively.  Around  any  operating 
trajectory  (Zo,  Do),  the  system  equations  could  be  written  as 

z  ■  As  +  Bu  (2) 

where  s  and  u  are  perturbations  around  (Zo,  Do)  and  A  and  B  are  Jacoblans 
with  respect  to  Z  and  D  respectively.  We  tacitly  assume  that  the  system 
equations(l)  can  be  adequately  represented  by  the  linearised  version, 
equatlon(2)  for  the  purposes  of  determining  the  fast  and  slow  parts  of 
the  system.  It  may  be  pointed  out  that  the  homogeneous  part  of  the  solution 
of  equation(2)  is  governed  by  A  while  the  forcing  part  is  dictated  by  B. 

The  total  solution,  therefore,  is  a  summation  of  the  homogeneous  and  its 
forcing  part.  If  u,  the  forcing  function  is  slowly  varying  as  compared 
to  the  fast  components  of  the  homogeneous  solution,  then  we  can  assume  that 
the  fast  and  slow  components  of  the  solution  can  be  Identified  by  an  analysis 
of  A. 

Let  us  assume  that  the  eigenvalues  of  the  system  matrix  A  can  be  separated 
into  two  sets  W  and  S  of  order  Ni  and  N»  respectively,  such  that  eigenvalues 
and  eigenvalues  8j  $S  and  |s.|«  }f  jJ .  We  refer  to  such  systems  as 
two- time-scale;  the  setw  is  the  fast  set  of  eigenvalues  and  the  set  S  is 
the  slow  set.  Later,  we  will  further  assume  that  the  fast  set  also  contains 
at  least  one  complex  conjugate  pair  with  relatively  large  imaginary  parts. 
This  class  of  problems  is  referred  to  as  two-time-scale  and  oscillatory 
stiff.  (The  scheme  presented  below  works  whether  oscillations  are  present 
or  not.) 

We  can  rewrite  equation (2)  as 

*U_‘_  .*12 
*21  !  ^22 

where  z^  and  Zj  are  and  ^  dimensional  subvectors  of  z  and  the  A's  and  B's 
are  submatrices  of  A  and  B,  respectively,  having  proper  dimension.  We  can 
cast  the  A  matrix  into  its  Jordon  form  using  the  eigenvector  matrix  M  and 
its  inverse  Q,  thus, 

J  -  QAM  (4) 

It  can  be  easily  seen  that  if  the  transformation  P  ■  Mz  is  used  then  the 
system  equations  (2)  will  be  completely  decoupled  in  the  new  state  variable 
vector  P  and  the  homogeneous  part  of  the  solution  is  governed  by  the  eigen¬ 
values  in  the  Jordon  form  of  A.  Thus,  while  accomplishing  total  decoupling 
the  above  transformation  requires  the  evaluation  of  the  complete  set  of 
eigenvalues  and  its  eigenvectors.  But  as  Golub  and  Wilkinson  point  out  the 
amallest  eigenvalues  of  a  large  ill-conditioned  matrix  cannot  be  calculated 
accurately  and  hence  their  eigenvectors  are  approximate  at  best.  We  use 
a  slightly  different  approach  which  achieves  partial  decoupling  and  is  all 
that  is  needed  for  partitioning.  Only  a  very  brief  outline  of  the  method 
is  given  below  and  the  reader  may  refer  to  Reference  4  for  details.  We  will 
rewrite  equation  (4)  as  follows 


(5) 


^L°. 

Qn.' 

.Ql?i 

r  ■  1 

j*ll  '  *12 

T  ,  1 

0  1  J, 

1  . 

Q2i  ; 

Q2lj 

1*21  ;  *22 

[M~21  ;  M22 

where  Jj  and  J2  Are  submatrices  of  J  containing  the  fast  and  slow  eigenvalues 
respectively  and  (Mii^l)^  corresponds  to  the  eigenvectors  of  the  smaller 
set  of  ni  eigenvalues  that  has  been  identified  for  partial  decoupling.  Various 
other  submatrices  in  equation  (5)  can  be  easily  identified. 


Let  us  define  the  matrices  L  and  T  by  the  relations 


and 


(6) 

(7) 


We  will  further  define  a  transformation  of  the  (zi ,  state  space  into 

(x,y)T  space  by 


(8) 


Using  the  transformation  defined  by  equation  (8) ,  it  can  be  shown4  that  the 
system  equation  (3)  will  be  transformed  into 
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where 

*u  -  An  -  *12 1  a°> 

*22  "  *22  +  “12 

Note  that  *11  and  A22  contain  the  fast  and  slow  eigenvalues  respectively. 

Also,  it  has  been  shown4  that  L  satisfies  the  algebraic  Rlcatti  equation  (ARE) 

^1  +  A21  "  L*12L  “  A22L  "  0  (11) 


It  may  be  observed  by  looking  at  equation  (9)  that  in  the  (x,y)  state  space 
there  is  partial  decoupling  between  the  fast  and  slow  parts  of  the  system. 

The  evaluation  of  L  needs  only  the  eigenvectors  corresponding  to  the  fast 
n^  eigenvalues.  Even  if  this  L  is  not  accurate,  the  algebraic  matrix  Ricatti 
equation  (11)  could  be  used  to  formulate  an  Iterative  procedure  to  Improve 
its  accuracy.  If  order  reduction  were  the  main  consideration  then  equation 
(9)  could  be  used  towards  this  end  as  has  been  done  by  O'Malley  and  Anderson. 
However,  since  our  aim  is  to  integrate  the  original  nonlinear  system,  we  will 
follow  a  alightly  different  approach  as  discussed  below. 


EXTENSION  TO  NONLINEAR  SYSTEMS 


We  have  assumed  in  the  evaluation  of  the  transformation  matrix  that  the 
linearized  system  represents  adequately  the  original  nonlinear  system. 

This  is  true  at  least  in  a  small  neighborhood  of  the  operating  point 
(2 o»  U0)  ^  the  nonlinearities  are  severe,  otherwise  will  be  valid  over 
a  wider  range.  Thus,  deflnining  a  transformation  of  the  nonlinear  variables 

V  -  T2  (12) 

we  can  write  equation  (1)  as 

V  -  TP  (T”1  V,  U,  t)  (13) 

where  V(tQ)  -  T2(tQ) 

Denoting  the  first  n^  components  of  V  by  X  and  the  remaining  n2  components 
by  Y,  we  have 

VT  -  (X,Y)T 

and  equation  (1),  in  terms  of  X  and  Y  will  become 


G(X.t.Y.D) 

H(Y,t,X,U) 


(14) 


where  X(0)  *  Xq,  Y(0)  *  Yq  and  G  and  H  correspond  to  the  first  n.  and  n~ 
right-hand-side  components  of  the  nonlinear  equation.  Using  the  analogy  of 
the  linear  system  presented  earlier,  we  can  denote  the  X  and  Y  as  fast  and 
slow  variables  respectively.  Once  the  fast  and  slow  subsystems  have  been 
identified,  these  partitioned  subsystems  can  be  Integrated  using  a  multirate 
integration  scheme  as  discussed  in  the  next  section. 


As  mentioned  earlier,  the  nonlinearities  produce  a  continually  changing 
Jacobian  matrix  A  as  the  solution  unfolds.  Two  schemes  for  monitoring  this 
change  have  been  developed.  One  is  a  perturbation  technique  for  estimating 
a  residual  in  terms  of  A  at  a  point  in  time  and  in  terms  of  the  nonlinear 
functions  as  they  change  for  subsequent  times.  When  this  residual  has  grown 
beyond  some  pre-set  value,  the  new  A  may  be  computed  and  a  new  partitioning 
dtermlned  as  needed.  Another  scheme  monitors  the  "zero  block"  in  the  matrix 
T.  Nonlinearities  cause  the  entries  in  this  block  to  drift  away  from  zero, 
and  re-partitioning  can  be  instituted  when  the  norm  of  this  sub-matrix  exceeds 
some  appropriate  value.  These  results  can  be  used  in  devising  an  automated 
re-partitioning  scheme,  and  this  is  a  subject  for  continuing  research. 


MULTIRATE  INTEGRATION 

In  this  section  we  will  consider  that  the  original  system,  equation  (1), 
has  already  been  partitioned  into  its  slow  and  fast  components  as  given 
by  equation  (14)  and  will  present  a  scheme  to  Integrate  the  slow  system  by 
a  relatively  large  step  size  and  the  fast  system  by  small  step  size.  In  the 
discussion  below  an  s-stage  explicit  Runge-Kutta  routine1  will  be  used  to 
illustrate  the  multirate  scheme.  It  may  be  noted  that  many  other  integration 
routines  could  be  used  to  implement  the  multirate  schemes. 


Ve  have  for  the  slow  variable. 


Y  .  «  Y  +  hi.  c  k  ,  S-  c  ■  1 
n  +  1  n  r-1  r  r*  r-1  r 


k.  -  H(Y  ,  t  ,  X  ) 

1  n  n  n 

kr  -  H(Vn  +  hlb^V  tn+,  Xn4.  ) 

r*  r 

•  •  s 


*r  ^ 


(15) 

(J-6) 

(17) 

(18) 


Computing  kr  successively  requires  Xn+Sr ,  hence  we  need  to  integrate  the  fast 
variable.  The  Runge-Kutta-Merson  scheme  has  been  used  in  the  integration  of 
the  fast  variable  and  with  interpolation  of  Y(t)  based  on  values  Yn,  Yn+a.»  •  • 
Yn+ar_i  which  are  computed  successively  using  imbedded  Runge-Rutta  schemes  of 
highest  possible  order6.  Note  that  different  Integration  schemes  could  be  used 
to  integrate  the  two  subsystems. 


For  example,  in  order  to  compute 

k2  -  H  «„  +  ta*.  t 


we  have  to  determine  X  .  .  Since 

formula  n+a2 


^x+a^  (19) 

k^  is  known,  we  can  estimate  using  the  Euler 


Yn  +  a2  h  kx 


and  this  suggests  the  use  of  a  linear  polynomial 

Y(t)  ■  Y  +  (t-t  )  k-  in  the  interval  t  s  ts  t  +  a_h 
n  n  x  n  n  Z 

to  integrate  the  fast  subsystem.  The  resulting  Xa  +  a  can  now  be  used  to 
evaluate  k2  by  equation  (19)  .  Since  k2  is  known,  we  can  improve  the  estimate 
of  Yn+a2  usin8  the  improved  Euler  formula.  Estimate  Yq*^  using  the  linear 
polynomial  between  tQ  +  a 2  and  +  83  as  above.  Construct  a  second  order 
polynomial  based  on  Yn,  Yn+a2*  ^n+«3  to  be  valid  for  tn  +  a2s  t<  tn  V* 

Since  Xjj+a,  Is  known,  the  fast  variable  can  be  integrated  from  ^  +  aj  to 
tn  +  83.  we  can  use  XQ+aq  to  compute  k3«  The  above  scheme  can  be  repeated 
to  continue  the  integration.  Thus,  the  previously  evaluated  kj,  k2,  .  .  . 
ky.i  have  been  used  in  updating  Y^a.*  Yn+a, ,  .  .  .  Yn+ar_i  hence  the 
scheme  is  known  as  imbedded  Runge-Rutta  multirate  Integration.  The  above 
scheme  can  be  used  continuously  to  march  ahead  in  time. 


APPLICATIONS 

We  will  demonstrate  the  usefulness  of  the  scheme  for  two-time-scale  and 
oscillatory  stiff  problems  with  the  help  of  two  examples.  The  first  is  an 
autopilot  model  having  13  differential  equations  while  the  second  is  a 
pressurized  water  reactor  having  30  differential  equations.  In  the  first 
case,  the  system  was  stiff  at  the  start  of  Integration  itself  and  hence 
partitioning  was  done  at  time  t"0.  In  the  second  case,  the  system  became 
stiff  at  time  t»93  sec,  during  the  course  of  integration,  and  hence  the 
system  was  partitioned  at  93  sec.  In  both  cases,  there  was  considerable 
savings  in  computer  time  as  will  be  seen  below. 


AUTO-PILOT  SYSTEM 


The  autopilot  model  is  composed  of  four  major  subsystems,  the  vertical  sensing 
unit,  the  amplifier,  the  servo-motor  along  with  its  pressure  stabilizer,  and 
the  flight  dynamics  of  the  aircraft.  Fig.  1  shows  the  various  subsystems  and 
their  Interconnections.  The  system  equations  and  other  associated  details 
can  be  found  in  reference  7.  It  consists  of  13  differential  equations  and 
a  table  look  up.  The  eigenvalues  of  the  system  are  as  shown  in  Table  1. 

Since  an  analytical  solution  was  not  available  for  this  system,  a  bench  mark 
solution  was  obtained  numerically  by  solving  the  system  with  very  small  values 
of  relative  error  control  parameter.  Three  different  methods,  (a)  Runge-Kutta- 
Merson,  (b)  Gear's  stiffly  stable,  and  (c)  Hlndmarsh's  Episode  were  used  to 
yield  agreement  in  solutions  to  a  four-digit  accuracy.  For  ease  of  reference 
we  will  denote  the  multirate  Integration  scheme  by  Pq  and  the  methods  of  Runge- 
Kutta-Merson,  Gear  and  Hindmarsh  by  P^,  P3  and  Pn  respectively.  This  notation 
corresponds  to  Mj_,  M3,  and  respectively,  used  in  DAREP8  simulation  software 
for  these  three  Integration  methods. 

The  error  criterion  for  all  the  schemes  was  selected  so  that  it  was  not  more 
than  1.0Z  during  the  entire  run.  The  computer  times  involved  for  different 
schemes  are  shown  in  Table  2.  Two  representative  fast  variables  and  two 
representative  slow  variables  from  the  multirate  integration  scheme  are 
shown  in  Fig.  2.  From  the  results  of  Table  2,  it  may  be  concluded  that 
considerable  savings  in  time  could  be  obtained  by  partitioning  and  using 
multirate  schemes  to  integrate  such  partitioned  systems.  One  more  interesting 
conclusion  that  can  be  drawn  from  Table  2  is  that  the  Runge-Kutta-Merson 
method  yields  much  shorter  solution  time  as  compared  to  either  Gear's  stiff 
or  the  Episode  methods.  This  may  be  explained  from  the  fact  that  stiff  routines 
are  good  only  when  the  stiffness  is  caused  by  a  spread  of  real  eigenvalues 
in  a  system  not  having  oscillatory  component  in  its  fast  components. 

PRESSURIZED  WATER  REACTOR  SYSTEM 

The  simulation  of  nuclear  power  reactors  has  gained  more  attention  and  import¬ 
ance  since  the  Three  Mile  Island  accident  at  Harrisburg,  Pennsylvania.  The 
PWR  models  are  based  on  the  conservation  laws  of  mass,  momentum  and  energy. 

These  equations,  together  with  the  equations  of  state  for  water  and  steam, 
give  a  complete  set  of  equations.  The  integration  of  such  systems  may  not 
readily  submit  to  efficient  numerical  treatment  using  readily  available 
routines9,  as  we  will  see  for  this  example. 

The  PWR  model  is  based  on  the  data  for  St.  Lucie  Unit  2,  a  Combustion  Engineering 
plant  in  Florida.  In  building  the  reactor  model,  numerous  assumptions  have 
been  made1 0 .  It  may  be  pointed  out  that  while  some  of  these  assumptions  limit 
the  use  of  the  model  to  small  transients  only,  the  model  should  still  retain 
the  eigenvalue-eigenvector  structure  in  the  neighborhood  of  any  operating 
point  as  compared  to  more  detailed  models.  The  plant  as  modelled  is  a  one- 
loop  representation  of  the  two-loop  plant  and  is  shown  in  Fig.  3.  It  consists 
of  a  core,  a  pressurizer,  a  steam  generator  and  a  recirculating  pump.  The 
core  has  been  modelled  by  a  representative  average  channel,  and  the  primary 
coolant  is  assumed  to  be  under  pressure  such  that  no  boiling  is  allowed  in 
the  model.  The  steam  generator  is  simulated  by  a  representative  U-tube  and 
the  primary  flow  is  divided  into  three  sectors.  The  first  sector  extends 
from  the  inlet  to  the  level  of  the  boiling  interface,  the  second  sector  from 
this  interface  to  the  boiling  Interface  on  the  exit  side,  and  the  third  sector 
from  this  interface  on  the  exit  side  to  the  exit  itself.  Each  sector  is  rep¬ 
resented  by  an  average  temperature  node  and  an  outlet  temperature  node.  No 
boiling  is  permitted  for  the  primary  liquid.  The  non-boiling  region  of  the 


secondary  side  of  the  steam  generator  is  similarly  represented  by  an  outlet 
temperature,  while  the  downcomer  exit  temperature  is  determined  by  treating 
the  downcomer  as  a  tank  problem.  The  steam  production  rate  is  calculated 
by  knowing  the  overall  heat  transfer  coefficient  and  the  difference  between 
the  average  temperature  in  the  second  region  and  the  saturation  temperature. 

The  pressurlzer  has  been  modelled  to  represent  four  states  for  the  water 
in  the  bottom  portion  and  sterna  in  the  top  portion  of  the  pressurlzer.  These 
states  are  (a)  top  superheated,  bottom  subcooled,  (b)  top  superheated,  bottom 
boiling,  (c)  top  condensing,  bottom  subcooled,  and  (d)  top  condensing,  bottom 
boiling.  The  recirculation  pnmp  (centrifugal  type)  has  been  modelled  along 
the  traditional  lines  of  nondimens ional  homologus  curve.  This  treatment  allows 
the  pump  to  traverse  any  of  the  four  possible  quadrants  of  pump  operation. 

The  final  model  consists  of  thirty  differential  equations  and  a  large  number 
of  algebraic  equations  and  table  look  ups  for  the  steam  properties.  A  digital 
simulator  using  these  equations  has  been  constructed  and  maintained  as  an 
input  file  to  the  DARE?  translator.  The  response  of  some  of  the  variables 
for  a  10  percent  step  reduction  in  steam  flow  rate  is  shown  in  Fig.  4.  The 
response  appears  to  be  consistent  with  plant  transient  results  for  similar 
plants . 

Attempts  were  made  to  carryout  the  integration  for  the  above  transient  initiator 
up  to  100  sec.  For  all  of  the  three  methods  of  Integration  mentioned  earlier, 
the  first  93  secs  of  real  time  needed  1  sec  of  computed  CPU  time  for  each  second 
of  real  time.  At  93  sec,  the  system  became  stiff,  and  nearly  40  sec  was  needed 
for  each  second  of  real  time.  The  first  four  largest  eigenvalues  of  the  Jacobian 
at  two  times  (11  sec  and  93  sec)  are  shown  in  Table  3.  It  can  be  seen  that  the 
Imaginary  part  of  the  complex  pair  has  increased  by  25  fold  in  this  interval. 

The  equations  have  been  partitioned  into  fast  and  slow  parts  with  3  and  27 
differential  equations  in  the  fast  and  slow  systems  respectively.  The  three 
variables  that  have  been  selected  for  treatment  as  fast  variables  using  the 
approach  given  above  are  neutron  power,  delayed  neutron  precursors,  and  the 
recirculation  flow  rate.  The  simulation  was  restarted  at  93  sec  using  both 
the  original  and  partitioned  systems.  The  original  system  took  80  sec  of 
CPU  time  for  nearly  2  sec  of  system  time,  while  the  partitioned  system  reached 
15  sec  of  system  time  in  the  same  CPU  time.  Indicating  a  7  fold  saving  of 
CPU  time,  (table  4).  The  relative  error  for  the  most  sensitive  variable 
(neutron  power)  was  less  than  0.002  percent  at  95  sec. 


CONCLUSIONS 

Certain  two-time-scale  problems  can  be  advantageously  integrated  by  splitting 
them  into  fast  and  slow  component  parts  and  using  different  step  sizes  for 
integrating  the  fast  and  slow  subsystems.  Characteristics  of  the  linear  version 
of  the  system  model  were  used  to  identify  the  partitioned  systems.  Two  non¬ 
linear  examples  were  used  to  illustrate  the  scheme.  It  was  pointed  out  that 
a  nonlinear  system  which  was  not  stiff  to  start  with  could  become  stiff  during 
integration,  and  a  pressurized  water  reactor  model  for  which  this  happened 
was  presented.  It  was  shown  that  a  computer-time  savings  of  5-8  times  over 
conventional  integration  schemes  could  be  achieved  by  this  procedure. 
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NOMENCLATURE 


A  Jacobian  aatrix  with  respect  to  X 
Ajj  submatrix  of  A 

ar  constant  in  Runge-Kutta  algorithm 

B  Jacobian  aatrix  with  respect  to  Y 

bfi  constant  in  Runge-Kutta  algorithm 

ei  1th  column  of  identity  matrix 

Eq  local  truncation  error 

ELB  Liquid  height  in  the  pressurlzer 

ELR  steam  height  in  the  pressurlzer 

EMSU  surge  mass  flow  rate  in  and  out  of  the  pressurlzer 
EMSP  spray  water  rate  into  the  pressurlzer 
F  vector  describing  state  variable  relationships 

fl  "fast"  eigenvalue 

I*  lxl  Identity  Matrix 

J^  Jordan  blocks  of  order  n^  and  n2»  respectively 

kj.  approximation  to  slope  in  Runge-Kutta  algorithm 
L  principal  component  of  transformation  matrix  T 

M  matrix  of  eigenvectors  and  generalized  eigenvectors  for  A 
Mij  submatrix  of  M 

n  order  of  the  system 

order  of  "fast"  subsystem 
*2  order  of  "slow"  subsystem 

P  reactor  primary  pressure 

notation  for  integration  methods  used 
PS6  reactor  secondary  pressure 

PWR  reactor  core  power 

Q  matrix  of  the  inverse  of  the  eigenvector  matrix  M 
Qjj  submatrix  of  Q 

coupling  coefficient 
s^  "slow"  eigenvalue 

T  transformation  matrix 

t  time 

average  temperature  in  sector  1  of  the  steam  generator 

Ta2  average  temperature  in  sector  2  of  the  steam  generator 

T^3  average  temperature  in  sector  3  of  the  steam  generator 

Tca  average  temperature  in  core  channel 

Tgci  temperature  of  coolant  at  core  inlet 
T^Co  temperature  of  coolant  at  core  outlet 
Tcri  temperature  of  coolant  at  inlet  of  pressure  vessel 
Tcro  temperature  of  coolant  at  exit  of  pressure  vessel 
Td  temperature  of  secondary  coolant  at  downcomer  exit 
Tfa  average  temperature  of  fuel  in  the  core 
T{)ba  average  temperature  of  water  in  the  nonboiling  length 
TNBO  temperature  of  water  at  boiling  boundary 

Toi  exit  temperature  of  water  in  sector  1  of  the  steam  generator 

Tq2  exit  temperature  of  water  in  sector  2  of  the  steam  generator 

Tqj  exit  temperature  of  water  in  sector  3  of  the  steam  generator 

U  control  vector 

u  displacement  of  control  vector  from  operating  point  U0 
steam  exhausted  from  the  pressurlzer  safety  valves 
Wfir  feed  valve  flow  rate 

Wp  primary  coolant  flow  rate 

Wgp  recirculation  flow  rate 

W8  steam  generation  rate  in  the  steam  generator 

Wg.  steam  flow  rate  to  the  turbine 

Wgp  spray  water  rate  into  the  pressurlzer 

X  transformed  vector  for  the  nonlinear  system,  fast  variables 


*  transformed  vector  for  the  linearized  system,  fast  variables 

Zq  numerical  approximation  to  solution  in  Runge-Kutta  algorithm 

Y  transformed  vector  for  the  nonlinear  system,  slow  variables 

y  transformed  vector  for  the  linearized  system,  slow  variable 

Yn  numerical  approximation  to  solution  in  Runge-Kutta  algorithm 

Z  state  vector 

*  displacement  of  state  vector  from  operating  point  ZB 
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Table  1,  Eigenvalues  of  the  Autopilot  Systea 


Table  3 


rour  l»kc: 

Time 

(sec) 

tiKenvaiu es  or  cnt 

Eigenvalue 

No. 

!  jacoDian  at  xx.u  sec  ana  yj.u  sec. 

Eigenvalue 

Real  Part  ImaKlnarv  Part 

1 

-29.12 

0.0 

11.0 

2 

-100.68 

4.07 

3 

-100.68 

-4.07 

4 

-18.46 

0.0 

1 

-146.99 

0.0 

93.0 

2 

-61.41 

108.49 

3 

-61.41 

-108.49 

4 

-60.76 

0.0 

;A 


Table  4 

Comparison  of  Computation  Times  After  T  ■  93  sec. 

Method _ T(CPP)  /T(REAL) 

Episode  40 

Multirate  7 


Figure  1.  Block  Diagram  Representation  of  the  Autopilot  System 


Figure  2.  Two  Representative  Fast  and  Slow  Variables  of  the  Autopilot  System 
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APPLICATIONS  OF  PERTURBED  JUNCTIONAL  ITERATIONS  TD 
NONLINEAR  STIFF  INITIAL  VALUE  rHEMTCAT.  KINETIC  PROBLEMS 
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Abstract 

Numerical  solution  of  nonlinear  initial-value  problems  by  a  perturbed 
functional  iterative  scheme  are  discussed.  The  algorithm  does  not  linearize 
the  system  and  requires  only  the  diagonal  terms  of  the  Jacobian.  Some  exam¬ 
ples  are  presented. 

1.  Introduction 

Stiff  ODE’s  (ordinary  differential  equations)  are  of  special  significance 
in  chemical  kinetics,  where  decay  of  one  component  of  the  solution  could 
happen  much  faster  than  other  components.  For  numerical  solution  of  these 
problems,  it  is  generally  found  that  some  transient  components  which  are 
negligible  in  comparison  with  other  components,  restrict  the  step  sizes  of 
the  explicit  methods  to  be  very  small  in  order  that  stability  of  the  numeri¬ 
cal  solution  may  be  maintained.  Also,  in  order  to  understand  the  proper 
nature  of  the  solution,  the  transient  effects  of  the  fast  decaying  terms 
should  be  revealed  by  the  numerical  process.  This  poses  a  very  difficult 
problem.  During  the  last  decade,  extensive  research  has  been  performed  by 
mathematicians,  engineers,  and  chemists  to  reveal  the  mechanism  of  stiff 
systems  both  mathematically  and  computationally. 


Most  conventional  explicit  methods  such  as  Euler's  method,  Runge-Kutta 
schemes,  Adams-Bashforth  schemes,  etc.,  require  very  small  step  sizes  so  that 
the  algorithm  may  remain  stable.  Although  some  attempts  have  b%en  made  to 
extend  the  stability  properties  of  explicit  methods  for  special  types  of 
ODE's  [ll]  by  far  the  most  common  technique  for  solving  stiff  systems  numer¬ 
ically  is  the  use  of  implicit  methods  which  requires  the  solution  of  simul¬ 
taneous  equations.  In  nonlinear  cases  Newton-type  methods  also  require  the 
evaluation  of  a  Jacobian,  a  process  that  has  a  high  arithmetic  operation 
count.  These  requirements  can  be  costl  in  both  computer  time  and  storage 
requirements. 

Let  us  now  pose  the  problem,  and  set  up  an  implicit  algorithm  for  the 
solution. 

$£  "  f <u,t)  (1) 

T  T 

where  u  -  (uj  u2  ...  %)  ;  f  -  (fj  f2  ...  fm)  ;  and  u(0)  •  uQ. 

Approximating  du/dt  by  a  backward  Euler  scheme,  we  find 

un+1  -  un  -  At  f(un+1,  tn+1) 
or 

„n+l  -  un  -  At  •  f(un+1,  tn+1)  -  0  (2) 

This  equation  must  be  solved  for  un+l  assuming  that  un  the  value  of  u 
at  the  previous  time  step  is  known.  Then,  we  may  represent  this  equation 

as 

F(un+1)  -  0  (3) 

If  U  ■  un+1,  then  it  becomes 

F(U)  -  0  (4  > 

which  is  a  nonlinear  system.  After  we  have  solved  this  system  for  U,  we 
update  the  system  by  replacing  un  by  the  computed  values  of  U  and 


2 


again  solve  another  nonlinear  system  of  the  form  (4),  etc.  The  process  will 
continue  until  we  reach  the  steady  state  which  is  here  assumed  to  exist. 

At  present,  there  are  several  methods  to  solve  nonlinear  systems  of  the 
form  (4).  They  may  be  arranged  into  two  classes: 

(1)  Methods  which  require  relatively  small  computer  memory  storage  and 
operational  counts  per  iteration  but  have  slow  rates  of  convergence  (e.g.. 
Point  Jacobi,  Point  Gauss-Seidel,  SOR  schemes) 

(li)  Methods  which  require  relatively  large  computer  memory  storage  and 
large  operational  counts  per  iteration  but  demonstrate  fast  rates  of  conver¬ 
gence  (e.g.,  Newton's  method). 

In  [1]  a  functional  iterative  scheme  has  been  developed  to  solve  non¬ 
linear  systems.  It  is  obtained  by  perturbing  nonlinear  Gauss-Seidel  itera¬ 
tions.  The  perturbation  parameters  stabilize  the  algorithm,  control  the 
mode  of  convergence,  and,  in  some  cases,  can  greatly  speed  up  the  rate  of 
convergence.  These  parameters  are  essentially  corrective  factors  for  the 
iterates.  Thus  as  the  iterations  .converge,  the  perturbation  parameters 
are  all  damped  out.  The  present  method  has  been  applied  to  solve  several 
stiff  nonlinear  ODE's.  The  results  seemed  to  be  encouraging  in  comparison 
with  those  obtained  by  conventional  explicit  schemes.  The  primary  objec¬ 
tive  in  this  article  is  to  show  how  to  apply  the  scheme  to  nonlinear 
systems  and  give  some  practical  demonstrations.  A  brief  discussion 
regarding  the  details  of  the  algorithm  and  its  convergence  properties  is 
also  Included. 

2.  The  Algorithm 

Given  a  differential  equation 

h(x,x,t)  -  0  ,  x(t0)  -  x0 

where  x  ■  dx/dt  and  t  is  the  independent  variable,  if  a  difference 
approximation  is  made,  it  is  reduced  to  a  nonlinear  system  of  the  form 


Fj (xj |  X2 »  • « •  >  Xjj)  *  0  j  j  *  •  n  (5) 

This  may  be  expressed  as  F(x)  ■  0,  x  ■  (xj  X2  ...  We  assume  that  a 

solution  exists.  Let  us  express  (5)  as: 

x  -  G0(x)  (6) 

*  ' 

G0t  DCRn  -*•  D  (Rn  ■  real  n-dimensional  space).  Nonlinear  Gauss-Seidel 
\ 

iterations  to  solve  (6)  may  be  expressed  as: 

xk+l  .  G(xk+1,  xk)  (7) 

where  xk  -  (xik  X2k  ...  *nk)T  e  D!  xjk  *  value  of  x^  at  the  kth  level 
of  iteration;  G:  DxDCRnxRn-»-D. 

Let  us  perturb  (7)  and  write: 

xk+1  -  ttk  +  G(xk+1 ,  xk)  (8) 

where  uk  -  Cujk  ui2k  ...  o»nk)T€  Rn  is  the  perturbation  parameter. 

If  the  method  converges,  11m  xk  ■  x*  and  G(x*,  x*)  »  x*,  which  implies; 

k-*"» 

Theorem:  1  A  necessary  condition  so  that  (4)  may  converge  is  that  for 
some  norm 

11m  ||  u)k  ||  -  0  (9) 

k  • 


In  the  element  form  (8)  may  be  expressed  as: 


Xjk+1  -  uijk  +  Gjk+1  »k 


(10) 


where 


Cjk+l.k  .  Gj  (xf1, 


k+i 


*  •  •  • 


k+i  v 
X  ,  X.*, 

j-1  J 


•  vk) 


(11) 
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The  algorithm  has  been  derived  in  details  in  [1]  and  Uj  is  computed  by 
using  the  following  equation: 


G(G*+.,k)-Gk+1. 

oj.  - - * - - - - - 


k+l  ,k 


(12) 


Uli  ]# 

where  G*  *  is  computed  using  (11), 

_  /_k+l,k\  /k+l  k+l  k+i  -k+l ,k  k  *  1 k\ 

Gj\  J  /  GjV  1  *  X2  ’  **•  *  XJ-1*  Gj  ’  Xj+1’  \) 


and 


)  Gk+1,k  ■  — 

i  i 


k+l  k+l 


1  »  *  r 

x2  »  *••  »  xj_i» 


k+l  k+l ,k  k 


t  AJ f  •••  I 


k+l  k 

Thus  in  (12),  for  x^  ,  is  computed  in  terms  of  quantities  known 
a  priori.  The  criterion  for  convergence  is,  if  r.t  some  k, 

max  |uk|  <  e 

i  J 

where  e  is  positive  and  arbitrarily  small. 

3.  Convergence  Analysis 

Given  a  sequence  of  scalars  {a^)»  k  ■  1,  2,  ...  ,  a^  is  called  a 


(13) 


D-element  iff 


lim  a.  a,  ...  a,  ■  0  (14) 

4  ®  ^  ^  K 


It  is  obvious  that  if  Vk  >  K,  |ajJ  -  a  <  1  is  satisfied. 

Given  a  sequence  of  square  matrices  {A^}  of  the  same  type  with  vari¬ 
able  elements,  is  called  a  D-matrix  iff 


lim  A.  A,  ,  . . .  A,  ■  0 
k  K-l  1 


(15) 


I 


5 


Obviously  if  A^  *  A  Vk,  A^  is  a  D-matrix  iff  A  is  a  convergent  matrix. 
A  sufficient  condition  that  A^  is  a  D-matrlx  is,  for  some  norm 

& 

I|aJ|  la<  1  (16 

•  v 

The  necessary  and  sufficient  condition  so  that  A^  is  a  D-matrix  is 

I  k,k-l,  ...  i|  „ 

max  a,  .  <  e  (17 


k  k*i  • •  l 

where  e  is  positive  and  arbitrarily  small  and  a  *  ’  •  an  element 

*|J 

of  the  product  matrix 


*k  ^t-l 


•  •  •  A. 


An  Example:  Let  \\  *  diag  (a.,  alf  bj) 

A2  -  diag  (a2,  o2,  b2),  A3  -  diag  (a3,  b3,  a3) 

A4  -  diag  (a4,  a4,  b4),  etc. 

Let  | a j |  -  a  <  1  Vj  >  N  and  a^'s  and  b^'s  are  bounded  and  chosen  arbitrarily 
and  a  finite  number  of  them  are  greater  than  1  in  absolute  value.  Then 

lira  A  A  ,  ...  A  »  0  (18) 

n-*«»  n  n_1  1 

Thus  each  A^  is  a  D-matrix,  however  none  of  these  matrices  is  a  convergent 


matrix. 


Let  us  consider  an  iterative  scheme 


k+i  k+i  k.  ,  i  -» 

x  ■  G(x  ,  x  ),  k  *  1,  2,  ... 


Let  for  a  given  x*  6  D, 


G(xk+1,  xk)  -  G(x*,  x*)  «  Ak(xk+1  -  x*)  +  Bk(xk  -  x*) 


Both  A^  and  B^  are  matrices  with  variable  elements  which  change  as  k 
changes . 
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G  is  called  a  D-aapping  on  D*D  iff  (I  -  A^"1  exists 

Vk  and  (I  -  A^)-^  is  a  D -matrix. 

The  following  theorem  may  now  be  proved. 

Theorem:  2  In  (8)  if  G  is  a  D-mapping  and  x  ■  x*  €  D,  then  the  scheme  of 

iteration  converges  to  x*  if  lim  |ol  |  ■  Q.  Furthermore,  if 

k-*“  * 

p||(I  -  Ajc)”lBJt|  |  <  1,  Vk  >  K,  x*  is  the  unique  root  in  D  (p(A)  -  spectral 
radius  of  a  matrix  A) . 

Proof : 

xk+1  -  x*  -  wk  +  G(xk+1,  xk)  -  G(x*,  x*) 

-  wk  +  A^x16*1-  x*)  +  Bk(xk  -  x*)  (20) 

Let  -  (I  -  Aj^)"1  and  ^  -  (I  -  A^-1^.  Then  from  (20) 

xk+1  -  x*  -  Ckuk  +  ^(x11  -  x*) 


“  £  ^k+j  ...  E^  (x  -  x*) 

k°  . 

“  (Ek  Ek0+l)  £  (Ek0  Vi  ***  Vi)Cj“ 

J-i 

+  <Ek  •••  Ej+1)cJ"1  +  \  \-i  ••• E.  (*°  -  **> 

Now  lim  |uk|  ■  0  Implies  for  some  k  >  k  +  1,  |oik|  <  e  (e  is  positive 
k  —  '  k 

and  arbitrarily  small).  Thus  each  element  is  arbitrarily  small  in 

absolute  value.  Also,  Ek  being  a  D-matrix 


lim 


*k  Vi 


...  E 


1 

k+1 


and  lim  E.  E.  .  ...  E.  .  -  0.  Hence,  |x  -  x*|  <  e  which  establishes 

a  K  a—  1  1  ” 


k  -*■  ® 
convergence 


To  prove  uniqueness  we  assume  y*  s  D  is  another  root.  Then 
x*  -  y*  -  G(x*.  y*)  -  G(y*,  y*) 
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This  gives 


x*  -  y*  -  E^(x*  -  y*) 

where  ^lim  -  E*.  Thus,  (I  -  |eJ)|x*  -  y*|  i  0.  Since  pCfE^j )  <  1  Vk  >  R, 
p(|s  I)  <  1.  Hence,  by  Neumann's  lemma,  (I  -  |E  |)_1  exists  and  is  non- 
negative.  Hence  |x*  -  y*|  -  0  which  implies  x*  ■  y*. 

More  discussions  on  D-matrices  and  D-mapr 'ngs  are  given  in  [15]. 

A.  Some  Comparison  with  Other  Methods 

Implicit  methods  for  solving  differential  equations  usually  demonstrate 
better  stability  properties  than  most  explicit  algorithms.  When  a  system  of 
nonlinear  differential  equations  is  reduced  to  a  nonlinear  difference  system, 
various  iterative  schemes  are  available  for  numerical  solution.  The  simplest 
methods  involve  functional  iterative  schemes  (Jacobi  or  Causs-Seidel  itera¬ 
tions)  which  have  generally  very  slow  rates  of  convergence.  Newton's  method 
has  a  quadratic  rate  of  convergence  which  may  be  expressed  as: 

||xk+1  -  x*  ||  -  a|[xk  -  x*||2 

This  Immediately  implies  that  even  if  0  <  a  <  1,  if  ||  x°  -  x*  ||  >1,  the 
method  could  fall.  Thus  one  basic  requirement  is  that  the  initial  guess 
x°  should  be  sufficiently  close  to  the  actual  root  x*.  This  Implies 
||  x°  -  x*  ||  <1.  For  many  initial-value  problem  this  condition  could  be 
satisfied.  However  the  main  disadvantage  is  that  at  each  k  level  of 
iteration  and  for  each  iterate  x  a  unique  Jacobian  must  be  computed  which 
is  not  quite  practical  and  for  large  systems  such  computations  are  very 
expensive. 

The  present  method  may  be  expressed  as  a  combination  of  nonlinear  Gauss- 
Seidel  iterations  and  Lieberstein's  method  as  follows: 


j  m  1,  2,  •••  n}  ic  *  0|  1,  2, 

Thus  if  r j  is  the  rate  of  convergence  of  Gauss-Seidel  iteration  and  r2 
is  the  rate  of  convergence  of  Liebersteln's  method ,  the  rate  of  convergence 
of  the  present  scheme  is  r*^.  Also,  if  at  each  level  Lipschitz  condition 
is  satisfied,  combining  both  steps  it  may  be  seen: 

ll*1*1  -**||  .  »*  11%“-**  II 

where  0  S  B  <  1.  Since  32  goes  to  zero  at  a  quadratic  speed,  the  method 
converges  even  if  ||  x°  -  x*||  >1.  For  various  nonlinear  systems  this  global 
convergence  property  has  been  verified  computationally  [14]. 

Furthermore,  whereas  Newton's  method  requires  (n2  +  n)  functionals  to 
be  computed  at  each  iteration  level,  only  3n  functionals  are  computed  at 
each  iteration  level  by  the  present  method.  Also,  to  compute  a  Jacobian 
(nxn),  n2  elements  must  be  stored  and  a  total  number  of  N  arithmetic 
operations  (addition,  subtraction,  multiplication)  must  be  done  where 
N  ■  n![2  +  1/2!  +  1/3!  +  ...  +  1/ (n— 1) ! ] .  In  comparison  with  this  the 
present  method  computes  and  stores  n-diagonal  elements  of  the  Jacobian. 

There  are  certain  interpolatory  methods  available  to  solve  stiff  ODE's 
[9].  These  algorithms,  developed  by  Certalne  and  Jain  reduce  differential 
equations  into  Integral  equations  which  were  integrated  by  approximating 
the  integrands  by  interpolation  polynomials  over  the  range  of  Integration. 
Although  both  of  these  methods  have  high  accuracy  and  are  A-stable,  for 
large  systems  they  are  not  practical. 
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The  present  method,  having  a  simple  algorithm,  is  applicable  to  most 
nonlinear  systems  in  general.  It  can  be  applied  to  nonlinear  integral  equa¬ 
tions  or  integro-differential  equations.  Some  applications  with  regard  to 
this  are  given  in  [12]. 

5*  Applications 

Magee  and  Chatterjee  [4]  developed  models  on  chemical  kinetics  and 
sought  solutions  of  these  problems  which  should  be  time-accurate.  These 
models  are  nonlinear  and  consist  of  a  sequence  of  stiff  ODE's.  Some  of  these 

_30 

chemical  species  grow  from  a  concentration  of  10  moles/liter  at  t  -  10-6 
sec  to  10~8  moles/liter  at  t  *  1.0  sec;  whereas  by  that  time  certain  species 
which  have  values  of  the  order  10"15  moles/liter  at  t  •  10“6  sec  grow  up  to 
10“5  molas/liter  at  t  ■  1.0  and  stay  almost  unchanged.  Since  the  equations 
are  stiff,  in  the  transient  process  numerical  effects  of  some  components  of 
the  solution  which  decay  much  faster  in  comparison  with  other  components  are 
quite  difficult  to  capture  in  a  computational  process.  This  lead s  to  some 
Inaccuracies  in  the  solution  which  eventually  causes  instabilities.  For 
example,  if  the  concentration  of  a  species  is  found  to  be  -10“ 3 5  mole/ liter 
but  whose  true  concentration  is  10"3°  moles/liter,  this  causes  severe  insta¬ 
bilities.  In  such  computations  which  are  very  "sensitive”  with  regard  to 
small  errors,  principles  of  perturbations  could  be  applied  in  order  to 
stabilize  the  algorithm.  Indeed  this  was  the  finding  when  two  distinct 
chemical  kinetics  problems  were  solved  by  the  method.  Let  us  consider  them. 

Models  representing  irradiation  of  water  with  y-rays  have  been  consid¬ 
ered.  Two  conditions  for  water  were  taken  (i)  Acid  Water  and  (ii)  Pure  Water. 
Model;  1  Irradiation  of  Acid  Water 

The  y-radiation  energy  is  absorbed  with  the  creation  of  molecular 
(H2,  H202)  and  radical  (H,  OH?  products  which  may  be  treated  as  if  formed 
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homogeneously  In  the  system.  The  mechanisms  related  to  formations  of  these 

species  are  described  in  [5,6].  Seven  species  created  by  the  radiation 

participate  in  thermal  reactions,  summarized  in  Table  1.  If  the  Irradiation 

* 

is  continuous  at  the  rate  of  100  I*s  electron  volts  per  liter  per  sec,  the 
differential  equations  which  describe  the  concentration  changes  are  given 
in  Table  2.  These  equations  are  stiff.  Here  creation  terms  give  concen¬ 
trations  of  created  species  in  terms  of  moles  per  liter  per  second. 

The  system  described  in  Table  2  may  be  expressed  as: 

where  u  -  [  (H)  (OH)  (H20)  (H2)  (H202)  (H02)  (02)  ]T  and  f  -  [fj  f2  ...  fy]T 
where  f i  corresponds  to  the  right  side  of  the  first  equation,  f2  corre¬ 
sponds  to  the  right  side  of  the  second  equation  etc.  Applying  a  backward 
Euler  difference  scheme  we  get 

u^1  -  u°  -  hf(un+*)  ■  0,  h  “  At 


This  nonlinear  system  for  u  may  now  be  expressed  as: 

F(U)  -  0 

where  U  ■  un+1  which  could  be  put  in  the  form  as 

U  -  G0(U) 

where  G0(U)  ■  U  +  a*F(U)  and  a  •  diag  (oj,  a2,  ...  07).  In  general, 

0  <  t  1.  This  is  done  primarily  because,  if  for  any  J  (j  ■  1,  2,  ...  7), 
l3GOj/3Ujl  becooes  00011  larger  than  1,  the  properties  of  D-Mappings  will  be 
violated  and  thus  the  method  is  very  likely  to  fail. 

A  numerical  solution  is  found  for  continuous  irradiation  at  constant 


rate  which  start  with  the  molecular  concentration  of  H2,  H202  and  02  equal 
to  zero.  For  this  case  (irradiated  and  water)  the  radical  concentrations 
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approach  "stationary  values"  very  quickly  and  as  known  from  the  chemical 
nature  of  the  system  that  H2  and  H2O2  are  destroyed  In  a  chain  reaction, 
these  concentrations  which  build  up  linearly  at  first,  approach  ^stationary 
values  on  a  longer  time  scale.  Computational  results  showed  these  behaviors 
of  various  concentrations  of  this  complicated  chemical  process.  Figures  1, 

2,  3  show  these  results  for  various  values  of  I.  (More  results  on  this 
project  will  be  published  from  Lawrence  Berkeley  Lab.)  From  "zero"  values, 

H  and  OH  quickly  approached  2  x  10“6  and  2.5  x  10“8,  respectively,  in 
0.2  sec  and  then  slowly  decreased  attaining  "stationary  values"  at  time  equal 
to  1  sec.  However,  H2,  H202,  H02  and  O2  attain  their  stationary  values  after 
almost  15  sec. 

Chatterjee  and  Magee  [4]  applied  an  implicit  second-order  Runge-Kutta 
method  to  solve  this  problem  and  recorded  its  failure. 

Model :  2  Irradiation  of  Pure  Water 

The  treatment  of  pure  water  is  somewhat  more  complicated  simply  because 
it  involves  more  species.  Here  H+  is  so  low  in  concentration  that  the 
hydrated  electrons  are  not  converted  into  H  atoms  before  the  trade  reactions 
occur  and  charged  species  must  be  treated  explicitly.  There  are  eleven 
equations  giving  the  rates  of  concentrations  of  eleven  species.  In  Table  3,  the 
thermal  reactions  and  in  Table  4  the  differential  equations  of  the  system  are 
given.  Continuous  irradiation  starting  with  zero  decompositions  for  all  con¬ 
centrations  of  species  excepting  H2O  ■  55  moles/liter,  H30+  «  1.0E  -  07 
moles/liter  and  OH  •  1.0E  -  07  moles/liter  was  done.  At  this  stage,  the 
ODE's  were  represented  as  integral  equations  which  were  approximated  by 
trapezoidal  rule.  The  method  virtually  failed  when  the  derivatives  were  approx¬ 
imated  by  backward  Euler's  difference  formula.  In  the  code  a  very  stringent 
convergence  criterion,  namely. 

It  —20 

WMAX  *  max  J  coV- 1  <  10 
j  J 


12 


was  used.  Since  Chis  should  give  a  very  high  order  of  accuracy,  it  was  felt 
that  such  a  criterion  is  necessary  in  order  to  get  an  almost  perfect  "mass 
balance"  and  "charge  balance"  both  of  which  should  be  (theoretically)  equal 

4. 

X 

to  "zero"  at  all  times.  In  submicroseconds,  the  radical  species  like  H,  OH, 
e^  etc.  grow  much  more  rapidly  than  the  molecular  species  02,  O2,  H2,  H2O2. 
These  have  strong  impact  upon  the  mathematical  model  for  the  subsequent 
chemical  yields.  Thus  At  •  10~®  was  chosen  initially.  It  was  noticed  that 
while  all  other  species  were  growing,  O2  and  O2  stayed  zero  up  to  5  »  10-8  sec. 
Then  O2  grew  faster  .d  O2  >  O2  at  all  time  levels.  The  special  properties  of 
these  species  which  possibly  have  some  strong  impact  upon  the  solution  as 
time  increases,  cannot  be  detected  if  a  large  time  step  is  used.  The  figure  4 
shows  the  behavior  of  various  species. 

6.  Discussions 

Vxi  V 

From  equation  (12)  it  is  clear  that  if  ’  •  1,  the  method  fails. 

It  has  been  proved  and  demonstrated  in  (14),  that  the  perturbation  parameters 

k 

Wj  stabilize  the  algorithm  and  speed  up  the  rate  of  convergence  provided 
the  Jacobian  of  the  matrix  representation  of  the  system  has  nonzero  diagonal 
elements  (see  the  Appendix).  When  diagonal  terms  are  null,  artificially  they 
are  brought  in  and  are  damped  out  as  convergence  is  approached.  This  may  be 
called  nonlinear  scaling.  However,  if  diagonal  elements  are  zeros  and  nonlinear 
scaling  is  not  performed  the  rate  of  convergence  is  significantly  decreased 
and  the  algorithm  could  even  be  destabilized.  This  was  found  with  regard  to 
solution  of  several  problems.  The  essential  strength  of  the  method  lies  in 
having  functionals  with  diagonal  nonlinearity.  Fortunately,  the  set  of  equa¬ 
tions  given  in  Table  2  and  Table  4  fulfill  these  requirements. 

By  computer  experimentation  it  has  been  found  that  for  nonlinear  systems 
having  multiple  roots,  the  present  scheme  is  not  quite  effective. 


Although  on  a  trial  basis  this  method  can  be  applied  to  solve  a  nonlinear 
system,  it  may  be  useful  to  apply  the  properties  of  D-Mappings  to  the  func¬ 
tionals  Gj  in  order  that  the  method  may  be  applied  successfully  [8,15],  Such 
an  analysis  is  generally  quite  complicated. 

7.  Concluding  Remarks 

This  is  a  preliminary  report.  The  objective  with  regard  to  the  solu¬ 
tion  of  the  chemical  kinetic  problem  was  primarily  to  study  the  transient 

just 

growths  of  the  species,  not*the  final  equilibrium  values  which  were  somewhat 
known  to  the  chemists  [4]  through  experimental  data.  With  this  regard,  the 
present  method  served  the  purpose  of  the  researchers  very  well. 
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Table  1:  Reactions  in  Irradiated  Water  (Acid) 


Reactions 

Reaction  rate 
constant, 

1/ (mol  s) 

A.  Recombination  of 

Primary  Radicals 

1. 

H  +  H  -*■  H2 

lxlO10 

2. 

H  +  OH  -*■  H20 

2.4  x 1010 

3. 

OH  +  OH  -*•  H202 

4  xio9 

B.  Reactions  of  Radicals 

and  Product  Molecules 

4. 

H  +  H202  ♦  H20  +  OH 

lx  108 

5. 

OH  +  H202  H20  +  H02 

5x  107 

6. 

OH  +  H2  •*  H20  +  H 

6x  107 

7. 

H02  +  H  -*■  H202 

lxlO10 

8. 

H02  +  OH  -  h20  +  02 

lx  1010 

9. 

H02  +  ho2  -*  H202  +  02 

2  x  106 

10. 

H  +  02  -  H02 

lxlO10 

Table  3:  Reactions  in  Irradiated  Neutral  Water 


Reaction  rate 
constant 

Reactions  i/(mol  s)  *• 


A.  Recombination  of  Primary  Radicals 

1. 

H  +  H  -*■  H2 

1  *  IO10 

2. 

e“Aq  +  H  H2  +  0H~ 

2.5  x  1010 

3. 

e“  +  e“  -*•  H,  +  20H" 
Aq  Aq  z 

6  x  10® 

4. 

e"  +  OH  ■+•  OH” 

Aq 

3  x  1010 

5. 

H  +  OH  H20 

2.4  x  1010 

6. 

OH  +  OH  -*•  H202 

4  x  10® 

7. 

H3°+  +  .-M  *  H 

2.3  x  io10 

8. 

H3<!>  +  OH”  -»■  H20 

3  x  10>° 

B.  Reactions  of  Radicals 

with  Product  Molecules 

9. 

H  +  H202  H20  +  OH 

1  x  io8 

10. 

e“Aq  +  H202  -*■  OH  +  OH" 

1.2  x  io10 

11. 

OH  +  H202  -*•  H20  +  H02 

5  x  107 

12. 

OH  +  H2  -*•  H20  +  H 

6  x  107 

13. 

H02  +  H  ♦  H202 

1  x  IO10 

14. 

e~Aq  +  °2  ■"  °2“ 

1.9  x  io10 

1 
I 
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Table  3:  Reactions  in  Irradiated  Neutral  Water 

(cont.) 


Reaction  rate* 
constant 

Reactions  £/(mol  s) 


B.  Reactions  of  Radicals  with  Product  Molecules 

(cont.) 


15.  H02  +  OH  -*■  H20  +  02 

1 

x  1010 

16.  H02  +  H02  +  H202  +  02 

2 

x  106 

17.  H  +  02  -*>  H02 

1 

X  1010 

18.  02“  +  H^O  -*•  H02  +  H20 

3 

x  IQ10 

C.  Dissociation  Reactions 

+ 

19.  H20  ->  H30  +  OH 

20.  H02  -*■  H30  +  02“ 

*Rate  constant,  sec"1 


5.5  x  10"6* 
1  x  106* 


4:  Differential  Equations  for  Transient  Species  and 
Radiation  Products  in  Irradiated  Neutral  Water 


:6(OH)2  -  k9(H)(H202)  -  k10(e"  )(H202)  -  ku(OH)(H202)  +  ki3(H02)(H)  +  ki6(H02) 


Table  4:  Differential  Equations  for  Transient  Species  and 
Radiation  Products  in  Irradiated  Neutral  Water 

(cont.) 


22 


Appendix 


The  transient  solutions  for  the  chemical  kinetic  problems  were  justified 
by  virtue  of  chemical  analysis.  When  solutions  went  into  the  stiff  regions , 
equilibrium  values  of  the  concentrations  of  the  species  were  found  and  those 
were  somewhat  known  to  the  chemists  [4]. 

At  this  stage,  a  question  may  still  arise  regarding  the  effectiveness 
of  this  method  (which  could  be  easily  verified  mathematically)  with  regard 
to  the  following  inquiries:  Given  a  stiff  system  (i)  how  good  is  this  method 

t 

to  generate  transient  solutions  which  could  be.  verified  mathematically  and 
(ii)  how  fast  equilibrium  values  may  be  found. 

We  also  intend  to  write  down  the  algorithm  of  the  method  with  regard 
to  the  solution  of  a  system. 

Consider: 

dx 


-  -10004  x  +  10000  y' 
■  -y  +  x  -  y4 


x(0)  -  y(0)  -  1 


(Al) 


This  is  a  stiff  system  given  in  [3].  It  has  an  approximate  analytical  solu¬ 
tion  given  by 


10004 

y  “  • 


10008 


e~3t  ~ 

.  -3t 
-  4e  J 


1/3 


(A2) 


10000  4 
x  "  10004  y 


Using  backward  Euler  approximation  the  difference  approximations  may  be 
expressed  as: 


1 

4 
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Xj  ■  At  (-10004  Xj  +  10000  y^)  +  x^  ^ 
yj  ’  +  “j  -  yj4)  +  yj- 


x  »  y  ■  1 
o  Jo 


Now  let  us  see  how  to  apply  the  present  algorithm  to  solve  the  differ¬ 
ence  equations  (A3)  and  (A4) ,  We  note  that  when  we  are  solving  for  x^  and 
jy  Xj_t  and  y  are  known. 

To  simplify  our  notations  let  us  replace  x.  by  X  and  y.  by  Y,  x.  by 

3  3  3  1 

XO  and  y^_j  by  Y0.  Then  the  system  may  be  expressed  as: 

X  -  F(X,Y) 

Y  -  G(X,Y) 

where  F(X,Y)  -  At(-10004  X  +  10000  Y4)  +  X0 
G(X,Y)  -  At(-Y  +  X  -  Y4)  +  YO 
Thus  -  -10004  At;  Gy  *  -4  At  Y3. 

Step  #1  Make  guesses  for  X  and  Y.  Usually  the  guesses  are 

XO,  YO. 

If 

At  some  (k  4-  1)  Iteration  and  t  time-level  (X  ■ 

n 

value  of  X  at  the  kth  Iteration) 


Step  #2  Compute: 

FI  - 

If  If 

F(X\  YK) 

Step  #3 

F2  - 

F(F1,  Yk) 

Step  #4 

DFDX 

-  FX(F1,  Yk) 

Step  OS 

WX  - 

(F2  -  Fl)/(1  -  DFDX) 

Step  #6 

xk+1 

-  WX  +  FI 

Step  #7 

G1  - 

k+l  k 

G(X,  YK) 

Step  0% 

G2  - 

G(Xk+1,  Gl) 

Step  #9 

DGDY 

-  GY(Xk+1,  Gl) 

Step  #10 

WY  - 

(G2  -  Gl)/ (1  -  DGDY) 
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Vil 

Step  #11  Y  -  WY  +  G1 

Step  #12  UMAX  -  max  (|WX| ,  |WY|) 

Step  #13  I£  UMAX  <  e  ,  convergence  is  found  and  we  go  to  Step 

#15,  otherwise  we  use  Step  #14 

Step  #14  Increase  the  level  of  Iteration  by  1  and  start  from 

Step  #2. 

However  if  KMAX  is  the  largest  number  of  Iterations  that  could  be  used  and 
if  k  >  KMAX  then  the  failure  of  the  method  is  recorded. 

Step  #15  Print  the  values  of  X  and  Y  (and  possibly  the  time- 

level  and  the  value  of  k) 

Step  #16  Now  time-level  is  incremented  by  1. 

Update  XO  and  YO  by  setting  XO  ■  X  and  YO  ■  Y 

(These  are  the  guesses  for  X  and  Y  at  this  new 

time-level  solution.) 

Then  again  at  some  (k  +  l)th  level  of  Iteration  of  this  time-step  computa¬ 
tions  start  from  Step  #2. 

-10 

In  general  we  choose  e  ■  10  . 

Case:l  Transient  Solutions 

In  the  Table:  Al,  the  transient-part  of  the  solutions  are  given.  The 
results  are  quite  close  to  those  obtained  analytically.  Here  we  have  used 
a  fixed  At  *  0.005. 

Case: 2  Steady-State  Solutions 

If  the  objective  is  to  compute  solutions  in  the  stiff  region,  we  may 
use  variable  time  steps  and  At  may  be  very  large.  Using  a  sequence  of 
At  -  1,  10,  102,  103,  ...  10®,  it  has  been  found  that  the  steady  state 
was  found  (with  no  oscillations)  after  a  total  of  55  Iterations  for  all 
time  steps  using  Radio  Shack  TRS-80  color  microcomputer  (16K)  and  the 
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total  time  for  computation  was  little  less  than  a  minute.  As  expected 
went  to  zero  much  faster  than  y^'s. 

|  At  this  stage  both  Gauss-Seidel  iterations  and  Newton's  method  were 


used.  Both  failed  for  each  case. 


Exact  Solution 


Dey 


t 

X 

y 

X 

y 

0 

1.0 

1.0 

1.0 

1.0 

m 

0.25 

0.368021198 

0.778953673 

0.368799110 

0.779286476 

~ . ' 

0.50 

0.135369435 

0.606629568 

0.136012224 

0.607287244 

0.75 

0.049796505 

0.472436166 

0.050160955 

0.473250544 

1.00 

0.0183185671 

0.367930928 

0.018499225 

0.368797576 

1.25 

0.73174825E-03 

0.286467519 

6,82246374E-03 

0.287398821 

1.50 

2. 479097 32E-03 

0.223160237 

2 . 516106E-03 

0.223965900 

ft 

1.75 

9.12006087E-04 

0.173797232 

9.27933E-04 

0.174533507 

2.0 

3.35507793E-04 

0.13535337 

3.42219E-04 

0.136011530 

2.25 

1 . 23426334E-04 

0.105413292 

1.26209E-04 

0.105991891 

E 

2.5 

4.54059955E-05 

0.0820959478 

4.6546E-05 

0.082598004 

6.667  x  10~”  in  the  logarithmic  scale. 


CONCENTRATION  (moles/l) 


B 


« 


i 


3.  Figure  3.  Concentrations  of  Species  (Model :1)  vs.  time  for 
I  ■  6.667  x  10”B  in  the  logarithmic  scale. 
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Nonlinear  Sensitivity  Analysis  in  Chemical  Kinetics  —  The 
Scaled  Green's  Function  Method 

Jenn-Tai  Hwang 
Department  of  Chemistry 
National  Tsing  Hua  University 
Hsinchu,  Taiwan  300 
Republic  of  China 

Abstract:  A  new  approach  to  the  nonlinear  sensitivity  problems 

in  chemical  kinetics  is  presented.  The  method  is  based  on  a 
Green’s  function  method  developed  earlier  by  Hwang,  et  al . 

In  order  to  calculate  higher-order  sensitivity  coefficients,  new 
ways  of  evaluating  the  Green's  function  matrix  and  the  sensitivity 
integrals  are  developed.  The  Green's  function  matrix  is  rescaled 
to  the  unity  matrix  at  the  beginning  of  every  step  in  the  numerical 
integration  of  the  rate  equations.  There  is  no  error  control  on 
the  Green's  function  matrix.  However,  due  to  the  rescaling 
procedure,  this  does  not  entail  any  stability  problem.  The  start-up 
difficulty  is  thus  completely  eliminated.  To  solve  for  the  Green's 
function  matrix  in  each  interval,  a  special  exponential  method  is 
employed.  No  matrix  inversions  are  needed  to  compute  the  sensi¬ 
tivity  coefficients. 

This  method  is  compared  and  contrasted  with  the  direct 
variation  method,  and  the  comparisons  indicated  it  to  be  a  very 
promising  approach  to  the  nonlinear  sensitivity  problem. 


1 .  Introduction 


The  need  for  systematic  sensitivity  analysis  of  large 
computational  models  has  become  increasingly  apparent  in  recent 
years. ^  ^  This  need  is  emerging  as  complex  numerical  models 
are  more  and  more  frequently  employed  for  problem  solving  in 
numerous,  areas  that  include  atmospheric  science,  combustion, 
chemical  laser  studies,  dynamical  engineering  systems  and  so  on. 

The  objective  of  a  sensitivity  analysis  is  to  study  the  response 
of  a  dynamical  system  to  parameter  variations.  When  the  com¬ 
plexity  of  the  models  makes  it  difficult  to  determine  the 
effect  errors  in  the  physical  and  chemical  parameters  have  on 
the  solution  vectors,  undertaking  of  a  sensitivity  analysis  is 
particularly  desirable. 

Conceptually  the  simplest  approach  to  a  sensitivity  analysis 
is  to  solve  the  system  equations  repeatedly  while  varying  one 
(or  several)  parameter (s)  at  a  time.  However,  this  soon  becomes 
impractical  as  the  number  of  parameters  subject  to  variation 
increases.  Three  methods  aiming  at  the  reduction  of  computa¬ 
tional  efforts  thus  appeared:  they  are  the  Fourier  amplitude 

5  7 

sensitivity  test  (FAST)  ,  the  direct  method  (DM)  ,  and  the  Green's 
function  method  (GFM)la.  The  applicability  regions  and  disadvan- 

1 C  j.0 

tages  of  the  above  three  methods  have  recently  been  discussed.  '  ' 

Here  we  merely  comment  that  these  three  methods  are  either  com¬ 
putationally  still  too  costly  or  are  limited  to  linear  sensitivity 


1 


studies  only.  By  a  linear  sensitivity  study  we  mean  a  study 
based  on  the  first-order  Taylor  series  expansion  of  the  solution 
vector.  As  a  linear  analysis  is  inadequate  for  a  system  with 
large  parameter  variations  and  computational  models  involving 
parameters  of  large  uncertainties  are  quite  common  —  especially 
models  in  chemical  kinetics  —  it  is  clearly  desirable  to  have 
a  practical  method  whereby  nonlinear  model  behavior  in  parametric 
space  can  be  assessed. 

In  this  work  a  new  nonlinear  sensitivity  method  based  on 
rescaling  of  the  Green's  function  matrix  in  GFMla. is  developed. 

To  prevent  excessive  computation  associated  with  the  frequent 
rescalings,  the  Green's  function  matrix  equation  is  solved  in 
a  rather  novel  fashion;  its  error  is  not  controlled.  This 
seemingly  outrageous  behavior  has  its  justification  in  the  very 
act  of  rescaling.  As  explained  in  Sec.  2,  rescaling  essentially 
stops  the  propagation  of  errors.  Therefore,  it  is  only  the  local 
truncation  error  that  determines  the  accuracy  of  the  Green's 
function  matrix;  the  usual  stability  problem  in  the  numerical 
integration  of  ordinary  differential  equations  never  appears. 

The  basic  theory  of  the  scaled  Green's  function  method 
(SGFM)  is  presented  in  the  next  section.  The  last  section 
concludes  with  discussions  on  possible  future  improvements  and 
applications  to  several  kinetics  systems. 


2 .  The  scaled  Green's  Function  Method  (SGFM)  of  Sensitivity 


Analysis 


Consider  a  general  nonlinear  first-order  N-vector  differential 
equation 


X'  =  l(Z>  t)  , 
with  the  initial  condition 


(1) 


y  (0)  =  b. 


(2) 


In  Eq.  (1)  the  prime  indicates  differentiation  with  respect  to 
t;  the  M-vector  a  denotes  collectively  a  set  of  M  parameters 
{a^}^_^;  f  is  an  N-vector  function  of  y,  a  and  t.  Equation  (1) 
can  be  viewed,  for  example,  as  N  coupled  chemical  rate  equations 
with  y(a,  t)  representing  the  N  species  concentrations  at  time 
t  and  a  the  rate  coefficients  associated  with  M  elementary 
chemical  reactions.  It  should  be  noted  here  that  due  to  the 
(possible)  vast  difference  in  time  constants  of  various  chemical 
species  in  (complex)  reaction  schemes,  Eq.  (1)  is  generally  stiff. 


In  order  to  study  the  response  of  y  to  the  variation  of 
a,  partials  of  y  with  respect  to  a  are  evaluated.  An  nth-order 
partial 


=  3ny/9a£  3a&  •••3aJ 


“a£  a4  *  *  *a2  ’  ~  ll  l2  '  ln 

12  n 


(3) 


is  called  an  nth-order  sensitivity  coefficient  of  y  with 

respect  to  a.  ,  a.  ,  *‘*a0  .  The  time  propagation  of  a  first- 
1  2 

order  (linear)  sensitivity  coefficient  y  is  governed  by  the 

l 

following  first-order  sensitivity  equation^-a'2 : 


(4) 


where 


y '  -  J  y,  =  f 

a£  ~  a£  £- 


f  =  3f/3a#, 
_a£  1 

Jii  =  3V3yj 


The  initial  condition  to  Eq.  (4)  is^a'^ 


v  (0)  =  0. 


A  second-order  sensitivity  coefficient  v  is  governed  by  the 

a£am 

following  second-order  sensitivity  equation^3: 


Y*  -  J  v  ,  =  f  „  +  J  y,  +  J_  Y_  +  Lv  v  *  (8) 

a£am  "  a£  m  a£am  a£  m  “am  £  =  £  m 


where 


f  a  =  3af /3a. 3a  , 
a  .  a  £  in 

x.  m 

J  *  3J/3a., 

Lijk  *  3Jij/3yk' 


U  x) i  -  J  Lijkwjxk* 

“  J  «L 


Notice  that  Eq.  (4)  and  (8)  share  the  same  homogeneous  part 
a  characteristic  among  all  sensitivity  equations. 


To  solve  Eqs.  (4)  and  (8),  we  compute  first  a  Green's 
function  matrix  G  which  satisfies 


G'  =  J  g, 
G  ( 0 )  =  1, 


4 


J 


where  1  is  the  unity  matrix  .  Then 


V  <t)  =  G  (t) /q  dxG"1(x)f  (x); 


a  similar  equation  holds  for  ^a  a  (t) .  (In  what  follows,  only 

2.  m 

v  (t)  will  be  discussed;  the  same  argument  applies  to  v  (t) 
a£  aSlam 

and  other  higher-order  sensitivity  coefficients.)  The  set  of 

M(M+3)/2  N-vector  differential  equations  [Eqs.  (4)  and  (8)]  has 

thus  been  replaced  by  a  set  of  N  N-vector  differential .equations 

(Eqs.  (13) - (14) ]  and  M(M+3)/2  N-vector  integrals  [Eq.  (15)  and  a 

similar  equation  for  y  ] .  This  is  advantageous  when  M  is 

a£am 

large  because  numerical  quadrature  of  reasonably  smooth  integrands 

is  much  easier  than  numerical  integration  of  stiff  ordinary 

differential  equations  —  recall  that  in  chemical  kinetics  M  is 

2 

of  the  order  of  N  . 


Evaluation  of  £  through  Eq.  (15)  requires  G  .  Although 
a£ 

in  principle  g  is  never  singular,  numerically  it  could  be  so  for 
stiff  systems13.  One  way  out  of  this  difficulty  is  to  rescale  G 
to  1  at  a  sufficiently  fine  grid  of  points.  Let  us  assume  that 
G  is  rescaled  at  t^.tj, • • • ,tg.  If  Gr  is  the  scaled  solution  to 
Eq.  (13)  in  [t  .  ^+^1  which  satisfies 

Gr(tr)  =  1,  (16) 

1 6 

the  relations  between  G  and  the  Gr  are  easily  seen  to  be 


£(t)  «  GQ(t> 


te [ 0 , t1 ] 


is(t)Ss-l(ts) '  t£[ts,tl. 


0  '  1 
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Using  Eq.  (17)  ,  y  then  becomes 

^  A 


le 


Za  (t)  =  fir(t)[/J  dx  G-1  (x)  f  (x)  +  v  (t  )], 
al  ~r  fcr  "  “a9  a9.  r 


(18) 


l  *  wr  -  “s. 

where  te[tr,  tr+il .  (An  equation  similar  in  form  to  Eq.  (18) 

holds  for  v  .)  Equation  (18)  demonstrates  that  sensitivity 
£  m 

coefficients  can  be  evaluated  in  a  straightforward  and  recursive 
manner  without  ever  worrying  about  the  existence  of  scaling  factors, 
GQ(t1),  G1(t2),  •••  The  integrand  on  the  RHS  of  Eq.  (18), 


2(t,  x)  =  G  (tJG'^xJf  (x)  , 
-r  -r  -aA 


(19) 


If 


has  a  very  interesting  and  useful  property  :  at  the  two  end  points 


t  and  t. 


a(t,t)  =  fa  (t) , 
a(t,tr)  =  Gr(t)f  (tr) 


(20) 

(21) 


Equation  (21)  follows  from  the  initial  condition  of  the  scaled 
Green's  function  matrix  Gr,  Eq.  (16).  Notice  that  there  is  no 
involvement  of  G~^  in  Eqs.  (20) -(21). 


If  the  integrand  3  varies  sufficiently  slowly  in  tt  ,t],  or 

if  tr  and  t  are  sufficiently  close  together,  the  integral  can  be 

calculated  by  means  of  two  end-points  only.  Thus,  using  the 

trapezoidal  rule  gives 

(t-t  ) 

y.  (t)  -  -2  r  [f  (t)  +  G  (t)  f  (tr)J  + 

al  *  ~ai  ~r  a£  r 

G(t)  v  (t).  (22) 

-r  a.t  r 
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Equation  (22)  and  other  similar  equations  for  the  higher-order 
sensitivity  coefficients  are  the  main  working  equations  of  the 
SGFM.  Notice  that  using  other  two-point  quadrature  formulae  can 
change  the  form  of  Eq.  (24)  ,  but  the  nonnecessity  of  performing 
matrix-  inversion  remains.  The  validity  of  Eq.  (22)  rests  on 
the  slowness  in  variation  of  the  integrand,  or  the  closeness 
of  the  two  integration  end-points.  Clearly,  these  two  conditions 
are  satisfied  if  we  rescale  G  at  every  step.  That  is,  during 
the  normal  numerical  integration  of  Eqs.  (1)  and  (13) ,  we  let 
G  equal  1  at  the  beginning  of  every  step.  This  procedure, 
however,  would  lead  to  disaster  for  the  following  reason. 

To  solve  a  stiff  system  a  practical  way  is  to  use  a  variable- 

11-13  12-13 

step  variable-order  algorithm  ,  such  as  that  of  Gear 

Now,  in  Gear's  method  the  differential  equation  solver  takes  a 

large  step  only  when  the  rapidly  changing  component  has  decayed 

12 

to  an  insignificant  level  ,  a  situation  which  could  never  happen 
if  the  condition  of  a  stiff  system  is  repeatedly  altered  at  the 
start  of  every  new  step.  In  consequence,  the  step-size  remains 
unbearably  small  throughout  the  whole  integration  process. 

This  sort  of  severe  limitation  upon  step-size  is  the  main 
factor  which  renders  the  step-by-step  rescaling  a  prohibitively 
costly  procedure.  We  now  propose  to  raise  this  limitation  by 
solving  Eq.  (13)  simultaneously  with  Eq.  (1)  without  any  error 
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control  on  6.  In  this  way,  the  Green's  function  matrix  equation 
never  affects  the  choice  of  step-size  and  the  integration 
process  can  proceed  smoothly  even  in  the  presence  of  rescalings. 
Justification  for  this  outrageous  behavior  comes  from  the  very 
act  of  rescaling.  As  one  can  see  clearly,  setting  the  Green's 
function  matrix  g  to  a  definite  constant  matrix  i  involves 
absolutely  no  error  at  all.  Therefore,  the  possible  amplification 
of  errors  as  the  ODE  solver  progresses  from  one  step  to  another 
is  eliminated.  The  accuracy  of  g  in  each  step  depends  entirely 
upon  the  way  it  is  computed  there;  the  usual  stability  problem 
simply  does  not  appear  in  SGFM. 


To  calculate  g,  we  first  note  that  the  solution  to  Eq.  (13) 
always  changes  (exponentially)  faster  than  the  coefficient  J. 
(For  example,  the  solution  to  G'=tnG  is  proportional  to 
exp[tn+1/ (n+1) ] .)  Therefore,  the  step-size  h  chosen  by  Eq.  (1) 
is  definitely  unsuitable  for  g  when  h  is  large,  if  the  same 
method  as  that  used  to  calculate  ^  is  used  to  calculate  g. 

Let  us  proceed  by  writing  Eq.  (13)  as 


J. .G. . 
n  13 


+  k(Si)JikGkj' 


i, j=l,2, • • • ,N. 


(23) 


If  J  is  nearly  a  constant  in  (tQ,t]  (see  below),  then 


Gij  <t)=exp[Ju(t-t0)  H6ij+k(§i)  Jik/tQdxGkj  (x)exp[-Ju(x-t0)  ] } ,  (24) 
where  the  initial  condition 


§<V  =  i 


(25) 
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has  been  used.  To  obtain  an  explicit  expression  for  (t) , 
we  let,  as  guided  by  the  form  of  Eq.  (24) , 


Gkj,x|  ■  exPU^Ix-to'1- 

The  functions  (x)  are  evaluated  as  follows. 

*  <x-to>n  (n\ 

gkj(x)  =  exp(-Jkk(x-t0)]n|o  — -  G]Jj)(t0) 


^  ^”Jkk^  m  (1)  ^"*1  m+1 

=  G.  .(tQ)  I  —7^  --(x-t  )m  +  (t0)  I  - (x-tn)m+1 

kj  o  m-g  m!  0  kj  0  m®0  0 


G(L)  it  1 

+  -\r  --(x-t0)L  +  o((x-t0)L+1) , 


where 


Gkj’  <to)  -  < V  •  ,28) 

It  can  be  shown  readily  that  Eq.  (27)  gives  correct  time 
derivatives  of  Gk^  at  tQ  up  to  the  Lth-order .  Substituting 
Eq.  (27)  into  Eq.  (24)  then  yields 


Gij(t)  *  exp[J. .  <t-t0>]{6..  + 

(1-6.. )J..  -  A  (J. .-J . . ;  t-t_)  + 

13  ijm=0  in'  ®  j]  ii  0 


(1)  .K1  ( ~Jkk) m 


1  JiklGki  (V  E 

k  (*i)  lK  kJ  0  m=0 


St-  Am+l(Jkk-Jii'*  t't0)  + 


r{L)  it  i 
Gkj  fc0 


AL(Jkk"Jii;  t"t0)1  +  0((t-t0)L+iJi  },  (29) 


where 


Am(u;v)  «  /q  dx  xm  exp(ux). 
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The  A_  can  be  evaluated  in  a  recursive  manner: 
m 

Am(u;v)  =  u  1[vmexp(uv)  -  (u;v)  ] ,  m=l,2,---, 

Ag  (u; v)  «  u-1(exp(uv)  -1]. 


(31) 

v 

(32) 


We  would  like  to  emphasize  here  that  Eq.  (29)  does  not 
necessarily  imply  the  behavior  of  is  governed  by 
exp [ (t-tg) ] .  This  can  be  appreciated  more  easily  through 
examining  the  factors  expfJ^ (t-tQ)  ]Am(J..;.-Jii;t-tg) : 


"^m-l (Jjj,Jii; 

Bo(Jjj'Ju!t-to»  *  <Jjj-Jii)'ltaxP(J:j(t-t0)i  - 


exp[Jii(c  fc.g)  ] } . 


(33) 


(34) 


Clearly,  G^  (t)  as  given  by  Eq.  (29)  consists  of  exp[J\  j  (t-tQ)  ] 
for  all  j  and  also  powers  of  (t-tg)m  for  m=0,l,2, • • • ,L. 


The  free  parameter  L  in  the  theory  is  related  to  the  order 

of  accuracy  of  Eq.  (27)  and  can  be  chosen  in  a  variety  of  ways. 

For  example,  L  can  be  equated  simply  to  the  current  order  q  of 

12-13 

the  backward  differentiation  formula  used  in  solving 
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Eq.  (1).  The  GjJj^  (tQ)  •  n=l ,2 ,  •  •  •  •  ,L,  are  then  calculated 
through  direct  differentiation^.  Alternatively,  one  may  fix 
L  at  1  while  dividing  each  step  into  q  intervals.  Within  each 
interval,  Eq.  (29)  is  applied  to  solve  for  the  "component" 

Green's  function  matrix.  The  final  answer  is  then  obtained  by 
multiplying  together  the  resulting  q  "component"  Green's 
function  matrices.  For  example,  let  us  say  the  step  ttg,^] 
is  divided  into  two  parts:  [tg,t'J,  [t',t^].  Equation  (29) 
can  be  applied  to  each  interval  to  solve  for  S(t')  and  ^(t,), 
where  G  and  G^  satisfy  G (tg) -G^ (t ' ) =1 .  The  desired  g ( t^ )  is 
then  given  by  [cf.,  Eq.  (17)] 

G(tx)  =  G1(t1)  G ( t ' ) ,  (35) 

The  test  problems  next  section  were  solved  by  the  latter 
procedure.  There  are  two  reasons  for  this  choice.  Firstly, 
fixing  L  at  1  avoids  the  time-consuming  computation  of  higher- 
order  derivatives  of  G.  Secondly,  dividing  each  step  into  q 
intervals  reduces  the  time  variation  of  J  (recall  that  Eq.  (29) 
was  derived  under  the  assumption  that  J  is  nearly  a  constant) . 

The  theory  developed  in  this  section  is  not  exact,  especially 
so  when  no  error  control  is  performed  on  G.  Therefore,  some 
indications  as  to  the  reliability  of  computed  sensitivity 
coefficients  are  required.  One  useful  measure  for  this  purpose 
is  det  g,  the  determinant  of  Q:  As  can  be  shown^a,  the  exact 
det  §(t)  satisfies 
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1 


det  G (t)  *  expt/Jjdx  E  J..(x)3,  (36) 

0  i=1  ii 

which  allows  det  G  to  be  calculated  almost  effortlessly  as 
Egs.  (1)  and  (13)  are  being  solved.  Other  possible  candidates 
are  the  various  conservation  relations  among  elements  of  the  g 
matrix  (see  Appendix) .  When  serious  deviations  from  these 
measures  occur,  inferences  concerning  the  system  sensitivity 
behavior  are  at  best  doubtful.  In  this  case,  a  hybrid  method 
which  switches  to  DM^  may  have  to  be  used.  Notice,  however, 
that  there  are  many  interesting  kinetic  problems  of  which 
sensitivity  information  is  required  only  for  a  relatively  short 
time  span.  Take  the  whole  class  of  combustion  problems  for 
example.  Here,  an  analysis  of  the  reaction  mechanism  near 
ignition  is  highly  desirable,  both  for  academic  and  for  practical 

-5 

reasons.  As  ignition  time  is  typically  10  sec,  SGFM  of 
sensitivity  analysis  may  usually  be  applied  with  confidence. 

3 .  Discussions 

The  SGFM  of  sensitivity  analysis  was  applied  to  (1)  a  simple 

^1  3 

first-order  reaction  system  (C^- - -  C2;  C1(0)=10  ,  C2(0)=1, 

3  *2 

k1*10  ,  k2=l) ,  (2)  Chapman  mechanism  for  atmospheric  ozone 

kinetics  (2  species,  4  steps) (3)  nitrous  oxide  decomposition 

(7  species,  9  steps;  T=2512°K,  P(atm)=0.37  (73.7%  Ar,  2.0%  N20, 

14 

21.3%  N2,  3%  02) )  ,  and  (4)  Bowman  mechanism  for  methane 

4 

oxidation  (13  species,  23  steps)  .  The  time  spans  of  sensitivity 

-4 

computation  are  1  sec  (system  1) ,  20  sec  (system  2) ,  10  sec 
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(system  3) ,  and  2.5x10  ^  sec  (system  4) .  All  calculations  were 
done  in  single  precision  on  a  CDC-CYBER-172  computer.  Tolerance 
varies  from  10  (systems  1  and  2)  to  10  (systems  3  and  4) . 

For  all  systems  studied,  the  first-  and  second-order  sensitivity 
coefficients  computed  by  SGFM  and  DM  generally  agreed  with  each 
other.  However,  SGFM  enjoyed  a  considerably  higher  computation 
speed.  For  example,  SGFM  is  -10  times  faster  than  DM  for  system 
2,  and  over  400  times  faster  for  system  3.  More  details  will  be 
presented  at  the  conference. 


There  are  several  possible  areas  where  improvements  can 

likely  be  made.  The  first  area  concerns  the  method  of  numerical 

integration:  Trapezoidal  rule  is  evidently  not  the  best,  and 

1  ^ 

a  two-point  exponential  rule  (or  other  more  sophisticated  ones) 
could  do  better.  Another  area  is  the  manner  in  which  sensitivity 
coefficients  are  computed.  Presently,  the  sensitivity  coefficients 
are  computed  once  for  each  step,  while  Green's  function  matrix 
is  computed  q  times  for  each  step.  It  is  clear  that  higher 
accuracy  may  be  achieved  for  the  first-order  sensitivity 
coefficients  if  they  are  also  computed  q  times.  We  do  not  expect 
this  procedure  to  require  too  much  extra  computational  time. 
However,  the  resultant  improvements  to  second-order  sensitivity 


coefficients  should  be  significant.  Finally,  the  way  g^j (x) 
are  computed  (see  Eq.  (27)]  may  be  improved.  For  example,  one 


may  try  to  eliminate  the  necessity  of  GjV  *  (x)  by  techniques 


similar  to  the  multi-step  method  in  numerical  solution  of 
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ordinary  differential  equations. 


Appendix 

We  examine  in  this  appendix  some  general  properties  of  the 
Green's  function  matrix  G  defined  by  Eqs.  (13) -(14).  The 
discussions  are  pertinent  to  chemical  kinetics  only. 


First  we  prove  that  the  determinant  of  §  is  in  general  a 
monotonic  decreasing  function  of  time.  In  chemical  kinetics, 
tho  rate  of  change  of  species  i  can  usually  be  written  as 

Ci  =  "DiCi  +  Pi  (A1) 


In  Eq.  (Al) ,  the  prime  indicates  differentiation  with  respect  to 
time;  Ci  is  the  concentration  of  species  i;  -D^  is  the  con¬ 
tribution  to  resulting  from  destruction  of  species  i;  P.^  is 
the  production  part  which  is  independent  of  Ci-  Clearly, 


3C|/9Ci 


=  -Di  -  ODi/3Ci)Ci 

<0.  (A2) 

Therefore, 

det  g(t)  =  expl/gdx  Z  J^fx)] 

(See  Ref.  la)  is  a  monotonic  decreasing  function  of  t. 
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We  then  show  that  columns  of  the  §  matrix  satisfy  certain 
conservation  relations :  For  example,  corresponding  to  conservation 
of  mass, 

(.1  m.C.) •  -  0,  (A3) 

i-l 

where  m^  is  the  molecular  weight  of  species  i,  we  have 


(Ira.G. .) • 
i  1  x3 


?miJik)Gkj 


=  0. 


(A4) 


That  is,  pUjGjj  is  a  conserved  quantity.  Relation  (A4)  holds  for 
all  columsn  of  the  §  matrix.  Other  conservation  relations  of  § 
(say,  those  corresponding  to  the  conservation  of  atomic  species) 
may  be  derived  in  a  similar  fashion. 
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The  computer-based  analysis  of  chemical  engineering  plants 
is  progressing  rapidly  beyond  the  conventional  steady  state 
energy  and  material  balance  calculations.  This  increased  interest 
in  computer-based  analysis  is  due  in  particular  to:  (1)  consider¬ 
ation  of  new  chemical  processing  units  and  plants,  e.g.,  coal 
conversion  plants,  and  (2)  the  need  to  better  understand  these 
proposed  units  and  plants  because  of  the  large  costs  involved  in 
building  them.  Any  analytical  tool  that  can  be  applied  to  better 
focus  the  design  effort  is  potentially  valuable  in  avoiding 
costly  design  errors.  Thus,  there  is  increasing  interest  in 
applying  first-principle  models,  i.e.,  models  based  on  fundamental 
conservation  principles,  and  thermodynamics,  kinetics  and  transport 
processes,  which  offer  the  possibility  of  a  fundamental  understand¬ 
ing  of  the  proposed  units.  Generally,  these  models  are  expressed 
as  systems  of  differential  equations,  particularly  if  unsteady 
state  (dynamic)  operation  is  to  be  considered.  Dynamic  simulation 
is  indispensable  for  the  understanding  of  the  stability  and 
controllability  of  proposed  units  and  plants. 
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When  a  first-principle  model  is  formulated,  it  typically 
involves  a  large  (greater  than  100)  system  of  nonlinear  ordinary 
differential  equations  (ODEs) .  Such  large-scale  ODE  problems 
are  demanding  in  terms  of  the  selection  and  use  of  a  numerical 
integrator.  Fortunately,  major  advances  in  the  development  and 
implementation  of  numerical  integrators  for  initial-value  ODEs 
have  been  made  in  the  past  ten  years.  Yet,  the  solution  of 
large-scale  ODE  problems  is  anything  but  routine.  The  success 
in  achieving  a  solution  with  reasonable  effort  is  still  quite 
dependent  on  the  particular  problem  system  and  the  experience  of 
the  analyst. 

In  this  paper,  we  relate  some  of  our  experiences  in  inte¬ 
grating  a  system  of  128  nonlinear  ODEs  which  were  formulated  for 
the  dynamic  simulation  of  the  plant  in  a  high-Btu  coal  gasi¬ 
fication  plant.  The  individual  units  in  the  H2  plant  model  and 
the  number  of  equations  for  each  unit  are  listed  below: 


Unit 

No.  of  PDEs 

No.  of  ODEs 

Reformer 

1 

21-point  NMOL 

Shift  reactors 

Unit  no .  1 

1 

21-point  NMOL 

Unit  no.  2 

1 

21-point  NMOL 

- 

C02  Absorber 

Absorber 

34 

-.V, 

Stripper 

10 

•-e 

Cleanup  Methanator 

1 

21 -point  NMOL 

To  the  best  of  our  knowledge ,  this  is  the  first  reported  attempt 
to  dynamically  simulate  an  entire  plant  with  interconnected  units 
modelled  both  as  discrete  (perfectly  mixed)  systems  and  distributed 
systems.  The  latter  lead  to  partial  differential  equations  (PDEs) 
which  were  approximated  by  the  numerical  method  of  lines  (NMOL) 
thereby  giving  a  final  set  of  ODEs  for  the  plant  model. 

The  128-ODE  system  has  a  complex  structure  when  viewed  in 
terms  of  its  Jacobian  matrix.  This  matrix  is  not  even  approxi¬ 
mately  banded  (it  is  nearly  lower- triangular  because  of  the  serial 
nature  of  the  H2  plant  interconnection  of  units)  and  it  is  not 
sparse.  Therefore,  if  we  conclude  that  the  ODEs  are  probably 
stiff  and  therefore  an  implicit  integrator  should  be  used,  the 
selection  of  an  integrator  designed  for  banded  or  sparse  systems 
may  not  accommodate  the  ODE  system  very  well.  In  fact,  we  found 
this  to  be  the  case  for  a  banded  integrator,  as  demonstrated  by 
unsuccessful  attempts  to  completely  integrate  the  ODE  system,  even 
with  large  computer  run  times.  Contrary  to  expectations,  we  found 
an  explicit  integrator  to  be  more  efficient  for  the  calculation  of 
a  complete  plant  transient  response  for  one  specific  set  of 
initial  conditions.  Of  course,  an  explicit  integrator  avoids: 

(1)  the  problem  of  matching  the  Jacobian  matrix  structure  of  the 
ODE  system  to  the  integrator  and  (2)  the  matrix  calculations  of 
the  implicit  integrator  which  may  be  quite  time  consuming.  However, 
the  use  of  an  explicit  integrator  was  not  the  final  answer  to 
successful  integration  of  the  ODE  system.  When  the  initial  con¬ 
ditions  were  changed,  the  explicit  integrator  failed.  Thus  we 
conclude  that  another  complicating  feature  of  large-scale  chemical 


process  dynamic  simulation  is  the  effect  of  nonlinearities.  This 
was  clearly  evident  as  we  mapped  the  Jacobian  matrix  of  the  ODE 
system  during  the  solution,  i.e.,  the  structure  changed  signifi¬ 
cantly,  and  the  magnitude  of  the  individual  elements  in  the  matrix 
also  changed,  sometimes  by  orders  of  magnitude.  The  nonlinearity 
clearly  had  a  detrimental  and  largely  unpredictable  effect  on  the 
performance  of  both  the  explicit  and  implicit  integrators  applied 
to  the  ODE  problem  system. 

As  a  result  of  this  experience,  we  would  like  to  pose  a 
series  of  questions  for  the  numerical  analysts  and  developers  of 
ODE  integrators: 

(1)  As  users  of  ODE  integrators,  rather  than  developers  of 
integrators,  can  we  reduce  the  often  painful  learning  experience 
apparently  required  for  each  new,  large-scale  problem,  e.g.,  the 
performance,  or  lack  of  performance  of  an  integrator,  is  typically 
determined  by  the  parameters  selected  by  the  user  when  the 
integrator  is  called  such  as  the  type  of  error  and  its  magnitude, 
minimum  step  size,  etc.? 

(2)  How  can  integrators  be  selected  more  rationally  than 
just  by  trial  and  error?  Will  sparse  matrix  integration  help? 

(3)  How  does  nonlinearity  affect  the  stability  of  the 
calculation,  i.e.,  the  apparent  stiffness? 

We  have  some  limited  answers  to  these  questions: 

(1)  Look  at  the  structure  of  the  ODE  system  in  terms  of  a 
map  of  the  Jacobian  matrix.  This  picture  of  the  structure  will 
assist  in  selecting  an  implicit  integrator  that  processes  a  Jacobian 


matrix  with  approximately  the  same  structure  as  the  ODE  problem 
system. 

(2)  Look  at  the  derivatives  of  the  system  in  detail. 
Specifically/  print  all  of  the  temporal  derivatives/  and  the  spatial 
derivatives  in  the  case  of  PDE  models/  corresponding  to  the  initial 
conditions.  This  will  detect  a  mismatch  between  the  problem  time 
scale  and  the  time  scale  defined  for  the  integrator.  If  the 
magnitudes  of  the  temporal  derivatives  look  unreasonable,  print 

the  magnitudes  of  the  individual  terms  which  comprise  the  deriva¬ 
tives.  In  general,  the  derivatives  dictate  what  the  solution  will 
do.  Therefore,  a  detailed  understanding  of  the  derivatives  is 
indispensable  in  finding  errors  and  determining  why  problems  exist 
with  the  numerical  integration. 

(3)  Compute  and  print  the  temporal  eigenvalues  of  the 
linearized  ODE  system.  This  gives  a  direct  indication  of  the 
stability  of  the  simulation  around  the  point  of  linearization  (the 
real  parts  of  the  eigenvalues  must  be  either  zero  or  negative) . 

If  (1)  and  (3)  are  repeated  during  the  course  of  the  calculation, 
changes  in  the  Jacobian  matrix  and  stiffness  due  to  nonlinearities 
can  be  observed  as  the  solution  evolves. 

(4)  If  an  implicit  integrator  is  selected,  use  the  map  of 
part  (1)  to  reorder  the  ODE  to  compress  the  apparent  bandwidth  of 
the  Jacobian  matrix.  This  reordering  can  generally  be  devised  from 
consideration  of  the  physical  problem  as  well  as  from  a  priori 
consideration  of  the  Jacobian  matrix  map. 

We  have  found  that  these  limited  answers  often  make  the 
difference  between  an  unsuccessful  solution  of  the  problem  ODE 
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